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Description 
BACKGROUND 

5 [0001] This application claims the benefit of U.S Provisional Application No.60/368,798 filed on March 29, 2002. 
[00021 This invention relates to diagnostics and prognostics for colorectal cancer based on the gene expression 
profiles of biological samples. 

[0003] Colorectal cancer is a heterogenous disease, consisting of tumors thought to emerge through three major 
molecular mechanisms: 1) mutations in the adenomatous polyposis coli (APC) gene, or the p-catenin gene, combined 

10 with chromosomal instability, 2) mutations in DNA mismatch repair genes, such as MLH1 , MSH2, PMS1 , PMS2 and 
MSH6, associated with microsatellite instability and mutations in genes containing short repeats, and 3) gene silencing 
induced by hypermethylation of the promoter regions of tumor suppressor genes. The genetic complement of individual 
colorectal cancers is likely to include different combinations of genetic instability, specific mutations, and gene silencing. 
Chromosomal instability (CIN) is a common feature of cancers in general. It implies an aneuploid phenotype, in which 

is whole chromosomes or large parts of them are being lost or gained. Microsomal instability (MIN) is found in diploid 
tumors with an increased mutation rate in short repeats. Both forms of genetic instability are common in colorectal 
cancer. 

[0004] Colorectal cancers thus have complex origins and involve a number of interactions in different biological 
pathways. Serum markers, histological, and cytological examinations historically used to assist in providing diagnostic, 
20 prognostic, or therapy monitoring decisions often do not have desired reliability. Likewise, while use of a single genetic 
marker (e.g., increased expression of a particular gene) may be beneficial, the diversity of the cancers make it more 
likely that a portfolio of genetic markers is the best approach. 

SUMMARY OF THE INVENTION 

25 

[0005] The invention is a method of assessing the presence or absence of colorectal cancer or the likely condition 
of a person believed to have colorectal cancer. In the method, a gene expression profile of a patient sample is analyzed 
to determine whether a patient has a colorectal cancer, whether a patient does not have colorectal cancer, whether a 
patient is likely to get colorectal cancer, or the response to treatment of a patient being treated for colorectal cancer. 
30 [0006] Articles used in practising the methods are also an aspect of the invention. Such articles include gene ex- 
pression profiles or representations of them that are fixed in machine-readable media such as computer readable 
media. 

[0007] Articles used to identify gene expression profiles can also include substrates or surfaces, such as microarrays, 
to capture and/or indicate the presence, absence, or degree of gene expression. 

35 

DETAILED DESCRIPTION 

[0008] The mere presence or absence of particular nucleic acid sequences in a tissue sample has only rarely been 
found to have diagnostic or prognostic value. Information about the expression of various proteins, peptides or mRNA, 

40 on the other hand, is increasingly viewed as important. The mere presence of nucleic acid sequences having the 
potential to express proteins, peptides, or mRNA ( such sequences referred to as "genes") within the genome by itself 
is not determinative of whether a protein, peptide, or mRNA is expressed in a given cell. Whether or not a given gene 
capable of expressing proteins, peptides, or mRNA does so and to what extent such expression occurs, if at all, is 
determined by a variety of complex factors. Irrespective of difficulties in understanding and assessing these factors, 

45 assaying gene expression can provide useful information about the occurrence of important events such as tumero- 
genesis, metastasis, apoptosis, and other clinically relevant phenomena. Relative indications of the degree to which 
genes are active or inactive can be found in gene expression profiles. The gene expression profiles of this invention 

are used to diagnose and treat patients for colorectal cancer. j 
[0009] Sample preparation requires the collection of patient samples. Patient samples used in the inventive method 

so are those that are suspected of containing diseased cells such as epithelial cells taken from a colon sample or from 
surgical margins. One useful technique for obtaining suspect samples is Laser Capture Microdisection (LCM). LCM 
technology provides a way to select the cells to be studied, minimizing variability caused by cell type heterogeneity. 
Consequently, moderate or small changes in gene expression between normal and cancerous cells can be readily 
detected. In a preferred method, the samples comprise circulating epithelial cells extracted from peripheral blood. 

55 These can be obtained according to a number of methods but the most preferred method is the magnetic separation 
technique described in U.S. Patent 6,1 36,1 82 assigned to Immunivest Corp which is incorporated herein by reference. 
Once the sample containing the cells of interest has been obtained, RNA is extracted and amplified and a gene ex- 
pression profile is obtained, preferably via micro-array, for genes in the appropriate portfolios. 



2 



EP 1 355 1 49 A2 



[0010] Preferred methods for establishing gene expression profiles include determining the amount of RNA that is 
produced by a gene that can code for a protein or peptide. This is accomplished by reverse transcriptase PCR 
(RT-PCR), competitive RT-PCR, real time RT-PCR, differential display RT-PCR, Northern Blot analysis and other re- 
lated tests. While it is possible to conduct these techniques using individual PCR reactions, it is best to amplify com- 

5 plimentary DNA (cDNA) or complimentary RNA (cRNA) produced from mRNA and analyze it via microarray. A number 
of different array configurations and methods for their production are known to those of skill in the art and are described 
in U.S. Patents such as: 5,445,934; 5,532,128; 5,556,752; 5,242,974; 5,384,261 ; 5,405,783; 5,412,087; 5,424,186; 
5,429,807; 5,436,327; 5,472,672; 5,527,681; 5,529,756; 5,545,531; 5,554,501; 5,561,071; 5,571,639; 5,593,839; 
5,599,695; 5,624,711; 5,658,734; and 5,700,637; the disclosures of which are incorporated herein by reference. 

10 [0011] Microarray technology allows for the measurement of the steady-state mRNA level of thousands of genes 
simultaneously thereby presenting a powerful tool for identifying effects such as the onset, arrest, or modulation of 
uncontrolled cell proliferation. Two microarray technologies are currently in wide use. The first are cDNA arrays and 
the second are oligonucleotide arrays. Although differences exist in the construction of these chips, essentially all 
downstream data analysis and output are the same. The product of these analyses are typically measurements of the 

15 intensity of the signal received from a labeled probe used to detect a cDNA sequence from the sample that hybridizes 
to a nucleic acid sequence at a known location on the microarray. Typically, the intensity of the signal is proportional 
to the quantity of cDNA, and thus mRNA, expressed in the sample cells. A large number of such techniques are 
available and useful. Preferred methods for determining gene expression can be found in US Patents 6,271 ,002 to 
Linsley, et al.; 6,218,122 to Friend, et al.; 6,218,114 to Peck, et al.; and 6,004,755 to Wang, et al., the disclosure of 

20 each of which is incorporated herein by reference. 

[0012] Analysis of the expression levels is conducted by comparing such intensities. This is best done by generating 
a ratio matrix of the expression intensities of genes in a test sample versus those in a control sample. For instance, 
the gene expression intensities from a diseased tissue can be compared with the expression intensities generated 
from normal tissue of the same type (e.g., diseased colon tissue sample vs. normal colon tissue sample). A ratio of 

25 these expression intensities indicates the fold-change in gene expression between the test and control samples. 

[0013] Gene expression profiles can also be displayed in a number of ways. The most common method is to arrange 
a raw fluorescence intensities or ratio matrix into a graphical dendogram where columns indicate test samples and 
rows indicate genes. The data is arranged so genes that have similar expression profiles are proximal to each other. 
The expression ratio for each gene is visualized as a color. For example, a ratio less than one (indicating down-regu- 

30 lation) may appear in the blue portion of the spectrum while a ratio greater than one (indicating up-regulation) may 
appear as a color in the red portion of the spectrum. Commercially available computer software programs are available 
to display such data including "GENESPRING" from Silicon Genetics, Inc. and "DISCOVERY" and "INFER" software 
from Partek, Inc. 

[0014] Modulated genes used in the methods of the invention are shown in Table 1 . The genes that are differentially 
35 expressed are shown as being either up regulated or down regulated in diseased cells. Up regulation and down reg- 
ulation are relative terms meaning that a detectable difference (beyond the contribution of noise in the system used to 
measure it) is found in the amount of expression of the genes relative to some baseline. In this case, the baseline is 
the measured gene expression of a normal cell. The genes of interest in the diseased cells are then either up regulated 
or down regulated relative to the baseline level using the same measurement method. Diseased, in this context, refers 
40 to an alteration of the state of a body that interrupts or disturbs, or has the potential to disturb, proper performance of 
bodily functions as occurs with the uncontrolled proliferation of cells. Someone is diagnosed with a disease when some 
aspect of that person's genotype or phenotype is consistent with the presence of the disease. However, the act of 
conducting a diagnosis or prognosis includes the determination disease/status issues such as therapy monitoring. In 
therapy monitoring, clinical judgments are made regarding the effect of a given course of therapy by comparing the 
45 expression of genes over time to determine whether the gene expression profiles have changed or are changing to 
patterns more consistent with normal tissue. 

[0015] Preferably, levels of up and down regulation are distinguished based on fold changes of the intensity meas- 
urements of hybridized microarray probes. A 2.0 fold difference is preferred for making such distinctions or a p-value 
less than .05. That is, before a gene is said to be differentially expressed in diseased versus normal cells, the diseased 

50 cell is found to yield at least 2 more, or 2 times less intensity than the normal cells. The greater the fold difference, the 
more preferred is use of the gene as a diagnostic. Genes selected for the gene expression profiles of the instant 
invention have expression levels that result in the generation of a signal that is distinguishable from those of the normal 
or non-modulated genes by an amount that exceeds background using clinical laboratory instrumentation. 
[0016] Statistical values can be used to confidently distinguish modulated from non-modulated genes and noise. 

55 Statistical tests find the genes most significantly different between diverse groups of samples. The Student's t-test is 
an example of a robust statistical test that can be used to find significant differences between two groups. The lower 
the p-value, the more compelling the evidence that the gene is showing a difference between the different groups. 
Nevertheless, since microarrays measure more than one gene at a time, tens of thousands of statistical tests may be 
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asked at one time. Because of this, there is likelihood to see small p-values just by chance and adjustments for this 
using a Sidak correction as well as a randomization/permutation experiment can be made. A p-value less than .05 by 
the t-test is evidence that the gene is significantly different. More compelling evidence is a p-value less then .05 after 
the Sidak correct is factored in. For a large number of samples in each group, a p-value less than 0.05 after the 

5 randomization/permutation test is the most compelling evidence of a significant difference. 

[0017] Another parameter that can be used to select genes that generate a signal that is greater than that of the 
non-modulated gene or noise is the use of a measurement of absolute signal difference. Preferably, the signal generated 
by the modulated gene expression is at least 20% different than those of the normal or non-modulated gene (on an 
absolute basis). It is even more preferred that such genes produce expression patterns that are at least 30% different 

10 than those of normal or non-modulated genes. 

[0018] Genes can be grouped so that information obtained about the set of genes in the group provides a sound 
basis for making a clinically relevant judgment such as a diagnosis, prognosis, or treatment choice. These sets of 
genes make up the portfolios of the invention. In this case, the judgments supported by the portfolios involve colorectal 
cancer. Portfolios of gene expression profiles can be comprised of combinations of genes described in Example 3. As 

15 with most diagnostic markers, it is often desirable to use the fewest number of markers sufficient to make a correct 
medical judgment. This prevents a delay in treatment pending further analysis as well inappropriate use of time and 
resources. In this case, such a minimal portfolio can be comprised of a combination of genes from Example 4. 
[0019] Preferably, portfolios are established such that the combination of genes in the portfolio exhibit improved 
sensitivity and specificity relative to individual genes or randomly selected combinations of genes. In the context of the 

20 instant invention, the sensitivity of the portfolio can be reflected in the fold differences exhibited by a gene's expression 
in the diseased state relative to the normal state. Specificity can be reflected in statistical measurements of the corre- 
lation of the signaling of gene expression with the condition of interest. For example, standard deviation can be a used 
as such a measurement. In considering a group of genes for inclusion in a portfolio, a small standard deviation in 
expression measurements correlates with greater specificity. Other measurements of variation such as correlation 

25 coefficients can also be used in this capacity. The most preferred method of establishing gene expression portfolios is 
through the use of optimization algorithms such as the mean variance algorithm widely used in establishing stock 
portfolios. This method is described in detail in the co-pending patent application entitled "Portfolio Selection" by Tim 
Jatkoe, et. al., of equal date hereto. Essentially, the method calls for the establishment of a set of inputs (stocks in 
financial applications, expression as measured by intensity here) that will optimize the return (e.g., signal that is gen- 

30 erated) one receives for using it while minimizing the variability of the return. Many commercial software programs are 
available to conduct such operations. "Wagner Associates Mean-Variance Optimization Application", referred to as 
"Wagner Software" throughout this specification, is preferred. This software uses functions from the "Wagner Associ- 
ates Mean- Variance Optimization Library" to determine an efficient frontier and optimal portfolios in the Markowitz 
sense is preferred. 

35 [0020] Use of this type of software requires that microarray data be transformed so that it can be treated as an input 
in the way stock return and risk measurements are used when the software is used for its intended financial analysis 
purposes. For example, when Wagner Software is employed in conjunction with microarray intensity measurements 
the following data transformation method is employed. 

[0021] Genes are first pre-selected by identifying those genes whose expression shows at least some minimal level 
40 of differentiation. The preferred pre-selection process is conducted as follows. A baseline class is selected. Typically, 
this will comprise genes from a population that does not have the condition of interest. For example, if one were inter- 
ested in selecting a portfolio of genes that are diagnostic for breast cancer, samples from patients without breast cancer 
can be used to make the baseline class. Once the baseline class is selected, the arithmetic mean and standard deviation 
is calculated for the indicator of gene expression of each gene for baseline class samples. This indicator is typically 
45 the fluorescent intensity of a microarray reading. The statistical data computed is then used to calculate a baseline 
value of (X*Standard Deviation + Mean) for each gene. This is the baseline reading for the gene from which all other 
samples will be compared. X is a stringency variable selected by the person formulating the portfolio. Higher values 
of X are more stringent than lower. Preferably, X is in the range of .5 to 3 with 2 to 3 being more preferred and 3 being 
most preferred. 

50 [0022] Ratios between each experimental sample (those displaying the condition of interest) versus baseline read- 
ings are then calculated. The ratios are then transformed to base 10 logarithmic values for ease of data handling by 
the software. This enables down regulated genes to display negative values necessary for optimization according to 
the Markman mean-variance algorithm using the Wagner Software. 

[0023] The preprocessed data comprising these transformed ratios are used as inputs in place of the asset return 
55 values that are normally used in the Wagner Software when it is used for financial analysis purposes. 

[0024] Once an efficient frontier is formulated, an optimized portfolio is selected for a given input level (return) or 
variance that corresponds to a point on the frontier. These inputs or variances are the predetermined standards set by 
the person formulating the portfolio. Stated differently, one seeking the optimum portfolio determines an acceptable 
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input level (indicative of sensitivity) or a given level of variance (indicative of specificity) and selects the genes that lie 
along the efficient frontier that correspond to that input level or variance. The Wagner Software can select such genes 
when an input level or variance is selected. It can also assign a weight to each gene in the portfolio as it would for a 
stock in a stock portfolio. 

5 [0025] Determining whether a sample has the condition for which the portfolio is diagnostic can be conducted by 
comparing the expression of the genes in the portfolio for the patient sample with calculated values of differentially 
expressed genes used to establish the portfolio. Preferably, a portfolio value is first generated by summing the multiples 
of the intensity value of each gene in the portfolio by the weight assigned to that gene in the portfolio selection process. 
A boundary value is then calculated by (Y*standard deviation + mean of the portfolio value for baseline groups) where 

10 Y is a stringency value having the same meaning as X described above. A sample having a portfolio value greater 
than the portfolio value of the baseline class is then classified as having the condition. If desired, this process can be 
conducted iteratively in accordance with well known statistical methods for improving confidence levels. 
[0026] Optionally one can reiterate this process until best prediction accuracy is obtained. 
[0027] The process of portfolio selection and characterization of an unknown is summarized as follows: 

15 

1 . Choose baseline class. 

2. Calculate mean, and standard deviation of each gene for baseline class samples. 

3. Calculate (X*Standard Deviation + Mean) for each gene. This is the baseline reading from which all other samples 
will be compared. X is a stringency variable with higher values of X being more stringent than lower. 

20 4. Calculate ratio between each Experimental sample versus baseline reading calculated in step 3. 

5. Transform ratios such that ratios less than 1 are negative (eg.using Log base 10). (Down regulated genes now 
correctly have negative values necessary for MV optimization). 

6. These transformed ratios are used as inputs in place of the asset returns that are normally used in the software 
application. 

25 7. The software will plot the efficient frontier and return an optimized portfolio at any point along the efficient frontier. 

8. Choose a desired return or variance on the efficient frontier. 

9. Calculate the Portfolio's Value for each sample by summing the multiples of each gene's intensity value by the 
weight generated by the portfolio selection algorithm. 

10. Calculate a boundary value by adding the mean Portfolio Value for Baseline groups to the multiple of Y and 
30 the Standard Deviation of the Baseline's Portfolio Values. Values greater than this boundary value shall be classified 

as the Experimental Class. 

11. Optionally one can reiterate this process until best prediction accuracy is obtained. 

[0028] Alternatively, genes can first be pre-selected by identifying those genes whose expression shows some min- 
35 imal level of differentiation. The pre-selection in this alternative method is preferably based on a threshold given by 



1 < 



where \i t is the mean of the subset known to possess the disease or condition, u.„ is the mean of the subset of normal 
samples, and o t + a n represent the combined standard deviations. A signal to noise cutoff can also be used by pre- 
selecting the data according to a relationship such as 



0.5 < 



[\i t - MAX n ) 



This ensures that genes that are pre-selected based on their differential modulation are differentiated in a clinically 
significant way. That is, above the noise level of instrumentation appropriate to the task of measuring the diagnostic 
parameters. For each marker pre-selected according to these criteria, a matrix is established in which columns repre- 
sents samples, rows represent markers and each element is a normalized intensity measurement for the expression 
of that marker according to the relationship: 



Qjf-0 
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where I is the intensity measurement. 

[0029] It is also possible to set additional boundary conditions to define the optimal portfolios. For example, portfolio 
size can be limited to a fixed range or number of markers. This can be done either by making data pre-selection criteria 
more stringent (e.g, 



.8< 



10 instead of 



0.5 < 



15 



(\i r MAX n ) 



or by using programming features such as restricting portfolio size. One could, for example, set the boundary condition 
that the efficient frontier is to be selected from among only the most optimal 10 genes. One could also use all of the 
genes pre-selected for determining the efficient frontier and then limit the number of genes selected (e.g., no more 
than 10). 

20 [0030] The process of selecting a portfolio can also include the application of heuristic rules. Preferably, such rules 
are formulated based on biology and an understanding of the technology used to produce clinical results. More pref- 
erably, they are applied to output from the optimization method. For example, the mean variance method of portfolio 
selection can be applied to microarray data for a number of genes differentially expressed in subjects with breast 
cancer. Output from the method would be an optimized set of genes that could include some genes that are expressed 

25 in peripheral blood as well as in diseased breast tissue. If sample used in the testing method are obtained from pe- 
ripheral blood and certain genes differentially expressed in instances of breast cancer could also be differentially ex- 
pressed in peripheral blood, then a heuristic rule can be applied in which a portfolio is selected from the efficient frontier 
excluding those that are differentially expressed in peripheral blood. Of course, the rule can be applied prior to the 
formation of the efficient frontier by, for example, applying the rule during data pre-selection. 

30 [0031 ] Other heuristic rules can be applied that are not necessarily related to the biology in question. For example, 
one can apply the rule that only a given percentage of the portfolio can be represented by a particular gene or genes. 
Commercially available software such as the Wagner Software readily accommodates these types of heuristics. This 
can be useful, for example, when factors other than accuracy and precision (e.g., anticipated licensing fees) have an 
impact on the desirability of including one or more genes. 

35 [0032] One method of the invention involves comparing gene expression profiles for various genes (or portfolios) to 
conduct diagnoses as described above. The gene expression profiles of each of the genes comprising the portfolio 
are fixed in a medium such as a computer readable medium. This can take a number of forms. For example, a table 
can be established into which the range of signals (e.g., intensity measurements) indicative of disease is input. Actual 
patient data can then be compared to the values in the table to determine whether the patient samples are normal or 

40 diseased. In a more sophisticated embodiment, patterns of the expression signals (e.g., flourescent intensity) are 
recorded digitally or graphically. The gene expression patterns from the gene portfolios used in conjunction with patient 
samples are then compared to the expression patterns. Pattern comparison software can then be used to determine 
whether the patient samples have a pattern indicative of the disease in question. Of course, these comparisons can 
also be used to determine whether the patient results are normal. The expression profiles of the samples are then 

45 compared to the portfolio of a normal or control cell. If the sample expression patterns are consistent with the expression 
pattern for a colorectal cancer then (in the absence of countervailing medical considerations) the patient is diagnosed 
as positive for colorectal cancer. If the sample expression patterns are consistent with the expression pattern from the 
normal/control cell then the patient is diagnosed negative for colorectal cancer. 

[0033] Numerous well known methods of pattern recognition are available. The following references provide some 
50 examples: 



Weighted Voting: Golub, TR., Slonim, DK., Tamaya, P., Huard, C, Gaasenbeek, M., Mesirov, JP., Coller, H., Loh, 
L, Downing, JR., Caligiuri, MA., Bloomfield, CD., Lander, ES. Molecular classification of cancer: class discovery 
and class prediction by gene expression monitoring. Science 286:531 -537, 1 999 
55 Support Vector Machines: Su, Al., Welsh, JB., Sapinoso, LM., Kern, SG., Dimitrov, P., Lapp, H., Schultz, PG., 

Powell, SM., Moskaluk, CA., Frierson, HF. Jr., Hampton, GM. Molecular classification of human carcinomas by 
use of gene expression signatures. Cancer Research 61 :7388-93, 2001 and 

Ramaswamy, S., Tamayo, P., Rifkin, R., Mukherjee, S., Yeang, CH., Angelo, M., Ladd, O, Reich, M., Latu- 
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lippe, E., Mesirov, JR, Poggio, T., Gerald, W., Loda, M., Lander, ES., Gould, TR. Multiclass cancer diagnosis using 
tumor gene expression signatures Proceedings of the National Academy of Sciences of the USA 98:1 51 49-1 51 54, 
2001 

K-nearest Neighbors: Ramaswamy, S., Tamayo, P., Rifkin, R., Mukherjee, S., Yeang, CH., Angelo, M., Ladd, C, 
5 Reich, M., Latulippe, E. ( Mesirov, JR, Poggio, T., Gerald, W., Loda, M., Lander, ES., Gould, TR. Multiclass cancer 

diagnosis using tumor gene expression signatures Proceedings of the National Academy of Sciences of the USA 
98:15149-15154, 2001 

Correlation Coefficients: van 't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart AA, Mao M, Peterse HL, van der 
Kooy K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsley PS, Bernards R, Friend SH. 
10 Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002 Jan 31 ;41 5(6871 ):530-6. 

[0034] The gene expression profiles of this invention can also be used in conjunction with other non-genetic diag- 
nostic methods useful in cancer diagnosis, prognosis, or treatment monitoring. For example, in some circumstances 
it is beneficial to combine the diagnostic power of the gene expression based methods described above with data from 

15 conventional markers such as serum protein markers (e.g., carcinoembryonic antigen). A range of such markers exists 
including such analytes as CA19-9, CA 125, GK-BB, and Guanylyl Cyclase C. In one such method, blood is periodically 
taken from a treated patient and then subjected to an enzyme immunoassay for one of the serum markers described 
above. When the concentration of the marker suggests the return of tumors or failure of therapy, a sample source 
amenable to gene expression analysis is taken. Where a suspicious mass exists, a fine needle aspirate is taken and 

20 gene expression profiles of cells taken from the mass are then analyzed as described above. Alternatively, tissue 
samples may be taken from areas adjacent to the tissue from which a tumor was previously removed. This approach 
can be particularly useful when other testing produces ambiguous. 

[0035] Combining the use of genetic markers with other diagnostics is most preferred when the reliability of the other 
diagnostic is suspect. For example, it is known that serum levels of CEA can be substantially affected by factors having 

25 nothing to do with a patient's cancer status. It can be beneficial to conduct a combination gene express ion/CE A assay 
when a patient being monitored following treatment for colon cancer shows heightened levels of routine CEA assays. 
[0036] Articles of this invention include representations of the gene expression profiles useful fortreating, diagnosing, 
prognosticating, and otherwise assessing diseases. These profile representations are reduced to a medium that can 
be automatically read by a machine such as computer readable media (magnetic, optical, and the like). The articles 

30 can also include instructions for assessing the gene expression profiles in such media. For example, the articles may 
comprise a CD ROM having computer instructions for comparing gene expression profiles of the portfolios of genes 
described above. The articles may also have gene expression profiles digitally recorded therein so that they may be 
compared with gene expression data from patient samples. Alternatively, the profiles can be recorded in different 
representational format. A graphical recordation is one such format. Clustering algorithms such as those incorporated 

35 jn "GEN ESPR ING" and "DISCOVE R" computer programs mentioned above can best assist in the visualization of such 
data. 

[0037] Different types of articles of manufacture according to the invention are media or formatted assays used to 
reveal gene expression profiles. These can comprise, for example, microarrays in which sequence complements or 
probes are affixed to a matrix to which the sequences indicative of the genes of interest combine creating a readable 
40 determinant of their presence. Alternatively, articles according to the invention can be fashioned into reagent kits for 
conducting hybridization, amplification, and signal generation indicative of the level of expression of the genes of in- 
terest for detecting colorectal cancer. 

[0038] Kits made according to the invention include formatted assays for determining the gene expression profiles. 
These can include all or some of the materials needed to conduct the assays such as reagents and instructions. 

45 [0039] The invention is further illustrated by the following non-limiting examples. 

Examples: Genes analyzed according to this invention are identified by reference to Gene ID Numbers in the GenBank 
database. These are typically related to full-length nucleic acid sequences that code for the production of a protein or 
peptide. One skilled in the art will recognize that identification of full-length sequences is not necessary from an ana- 
lytical point of view. That is, portions of the sequences or ESTs can be selected according to well-known principles for 

50 which probes can be designed to assess gene expression for the corresponding gene. 

Example 1- Sample Handling and LCM. 

[0040] Twenty-seven fresh frozen tissue samples were collected from patients who had surgery for a colorectal tumor. 
55 Nineteen of the samples were colorectal malignancy specimens, and eight of the samples were of normal colon mucosa. 
The tissues were snap frozen in liquid nitrogen within 20-30 minutes of harvesting, and stored at -80C° thereafter. For 
laser capture, the samples were cut (6jim), and one section was mounted on a glass slide, and the second on film (P. 
A.L.M.), which had been fixed onto a glass slide (Micro Slides Colorfrost, VWR Scientific, Media, PA). The section 
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mounted on a glass slide was after fixed in cold acetone, and stained with Mayer's Haematoxylin (Sigma, St. Louis, 
MO). A pathologist analyzed the samples for diagnosis and grade. The clinical stage was estimated from the accom- 
panying surgical pathology and clinical reports, using the Dukes classification. The section mounted on film was after 
fixed for five minutes in 100% ethanol, counter stained for 1 minute in eosin/100% ethanol ("lOO.ug of Eosin in 100ml 

5 of dehydrated ethanol), quickly soaked once in 100% ethanol to remove the free stain, and air dried for 10 minutes. 
[0041] Two of the colorectal adenocarcinomas were of grade 1 , 1 0 of grade 2, and 5 of grade 3. One of the malignant 
samples was a carcinoid tumor of the caecum, and one a metastatic melanoma lesion. Two of the adenocarcinoma 
samples represented the mucinous subtype, and one the signet cell subtype. The Dukes staging of the adenocarci- 
nomas divided them as follows: Dukes A: 2, Dukes B: 5, Dukes C: 7, Dukes D: 3. Six of the adenocarcinomas had 

10 been Irradiated preoperatively. 

[0042] Before use in LCM, the membrane (LPC-MEMBRANE PEN FOIL 1 .35 urn No 81 00, P.A.L.M. GmbH Mikrolaser 
Technologie, Bernried, Germany) and slides were pretreated to abolish RNases, and to enhance the attachment of 
the tissue sample onto the film. Briefly, the slides were washed in DEP H 2 0, and the film was washed in RNase AWAY 
(Molecular Bioproducts, Inc., San Diego, CA) and rinsed in DEP H 2 0. After attaching the film onto the glass slides, 

is the slides were baked at +1 20°C for 8 hours, treated with TI-SAD (Diagnostic Products Corporation, Los Angeles, CA, 
1 :50 in DEP H 2 0, filtered through cotton wool), and incubated at +37°C for30 minutes. Immediately before use, a 1 0\i\ 
aliquot of RNase inhibitor solution (Rnasin Inhibitor 2500U=33U/nl N211A, Promega GmbH, Mannheim, Germany, 
0.5\i\ in 400uJ of freezing solution, containing 0.15 mol NaCI, 10 mmol Tris pH 8.0, 0.25 mmol dithiothreitol) was spread 
onto the film, where the tissue sample was to be mounted. 

20 [0043] The tissue sections mounted on film were used for LCM. Approximately 2000 epithelial cells/sample were 
captured using the PALM Robot-Microbeam technology (P.A.L.M. Mikrolaser Technologie, Carl Zeiss, inc.Thornwood, 
NY), coupled into Zeiss Axiovert 135 microscope (Carl Zeiss Jena GmbH, Jena, Germany). The surrounding stroma 
in the normal mucosa, and the occasional intervening stromal components in cancer samples, were included. The 
captured cells were put in tubes in 100% ethanol and preserved at -80°C. 

25 

Example 2- RNA Extraction and Amplification. 

[0044] Zymo-Spin Column (Zymo Research, Orange, CA 92867) was used to extract total RNA from the LCM cap- 
tured samples. About 2 ng of total RNA was resuspended in 10 ul of water and 2 rounds of the T7 RNA polymerase 
30 based amplification were performed to yield about 50 ug of amplified RNA. 

Example 3- cDNA Microarray Hybridization and Quantitation. 

[0045] A set of cDNA microarrays consisting of approximately 20,000 human cDNA clones was used to test the 
35 samples. About 30 plant genes were also printed on the microarrays as a control for non-specific hybridization. Cy3-la- 
beled cDNA probes were synthesized from 5 ug of a RNA of the LCM captured cells. The probes were purified with 
Qiagen's N ucleotide Removal Columns and then hybridized to the microarrays for 14-16 hours. The slides were washed 
and air-dried before scanning. cDNA microarrays were scanned for cy3 fluorescence and ImaGene software (Biodis- 
covery, Los Angeles, CA) was used for quantitation. For each cDNA clone, four measurements were obtained using 
40 duplicate spots and duplicate arrays and the intensities were averaged. 

[0046] cDNAs were printed on amino silane-coated slides (Coming) with a Generation III Micro-array Spotter (Mo- 
lecular Dynamics). The cDNAs were PCR amplified, purified (Qiagen PCR purification kit), and mixed 1:1 with 10 M 
NaSCN printing buffer. Prior to hybridization micro-arrays were incubated in isopropanol at room temperature for 10 
min. The probes were incubated at 95°C for 2 min, at room temperature for 5 min, and then applied to three replicate 
45 slides. Cover slips were sealed onto the slides with DPX (Fluka) and incubated at 42°C overnight. Slides were then 
washed at 55°C for 5min in 1 X SSC/0.2% SDS and 0.1 X SSC/0.2% SDS, dipped in 0. 1X SSC and dried before being 
scanned by a Genlll Array Scanner (Molecular Dynamics). The fluorescence intensity for each spot was analyzed with 
AUTOGENE software (Biodiscovery, Los Angeles). 

[0047] Chip intensities were linearly normalized forcing the intensity reading at the 75 th percentile equivalent to a 
so value of 1 00 on each chip. Every gene on the chip was normalized to itself by dividing the intensity reading for that 
gene by the median of the gene's expression value readings over all the samples. Prior to clustering, genes that did 
not have an intensity reading of 1 00 or greater in at least one sample were filtered out in order to limit the background 
affect on the similarity metrics. A set of 6,225 genes was selected for clustering analysis. Hierarchical clustering was 
performed using correlation as ) a measure of similarity, which groups together samples with genes that are showing 
55 positive changes at the same time without any consideration for negative changes (Silicon Genetics, Sunnyville, CA). 
Each of the major nodes in the dendrogram was then considered a subgroup of samples. Differentially expressed 
genes were identified by comparing each tumor subgroup to the normal group. The selection was based on a signal 
to noise measurement threshold given by 
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where u>, is the mean of the tumor subset, u. n is the mean of the subset of normal samples, and o t + <y n represent the 
combined standard deviations. The within-group coefficient of variation of the intensity readings of a gene had to be 
less than 0.33 : for the gene to be included in the pair-wise comparisons. The median of the tumor group over the 
median of the normal group had to ) be greater than, or equal to 2 for up-regulation, and less than, or equal to 0.5 for 
down-regulation. If a gene met all the criteria, it was selected. The genes selected in all the comparisons were con- 
sidered consistently dysregulated in colorectal cancer. The p-values for the statistical significance were calculated 
using a T-test assuming unequal variance. The gene set for clustering was also subjected to principal component 
analysis (PCA) using a software package (Partek, St Louis, MO). The data was then projected onto the reduced 3-di- 
mensional space. The normal and tumor colorectal samples were represented by the projected expression levels. 
[0048] A list of genes with large up-regulated differentials was created to distinguish between the tumor and normal 
samples. One-hundred and twenty-three genes were preselected by using 



20 



0.5 < 



as a signal to noise cutoff. A ratio equal to, or greater than 1 .5 was the minimal criterion for up-regulation. Genes were 
also included if 



25 



0.9 < 



for M 



A portfolio of four genes was established, each having at least a three fold expression differential between tumor and 
normal cells. 

30 [0049] Differentially Expressed Genes in Colorectal Cancer. Thirty-nine genes were differentially expressed in all 
tumor samples as compared to normal colon mucosa. Thirty-seven of them were significantly down-regulated in all 
the tumors, except for an outlier. Two of them were up-regulated. The identities of the genes were verified by sequencing 
the cDNA clones placed on the microarray. Results are shown in Table 1 . 



35 



40 



45 



50 



Table 1 



Modulated Genes 


ACCESSION 


GENE 

DESCRIPTION 


MEAN SIGNAL 

INTENSITY 

(NORMAL) 


MEAN SIGNAL 

INTENSITY 

(TUMOR) 


P-VALUE 




AF071569 


CaM kinase II gene 
subtype delta 2. 


93 


39 


4.64E-09 


Seq. ID No. 1 


AB014530 


Homo sapiens 
mRNA for 
KIAA0630 protein 


108 


50 


4.83E-07 


Seq. ID No.2 


AK000319 


Human cDNA 
KIAA0630 


236 


69 


7.84E-06 


Seq. ID No.3 


U81504 


beta-3A-adaptin 
subunit of the AP-3 
complex mRNA, 


241 


75 


3.52E-05 


Seq. ID No.4 


AB011166 


Human cDNA 
KIAA0594 


116 


55 


3.53E-05 


Seq. ID No.5 


AB040914 


Human cDNA 
KIAA1481 


187 


59 


8.85E-05 


Seq. ID No.6 
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Table 1 (continued) 





Modulated Genes J 




ACCESSION 


GENE 


MEAN SIGNAL 


MEAN SIGNAL 


P-VALUE 




5 




DESCRIPTION 


INTENSITY 
(NORMAL) 


INTENSITY 
(TUMOR) 








AK025205 


Human cDNA 
FLJ21552 


322 


97 


0.00013 


Seq. ID No.7 


10 


AJ278219 


Fatty acid 
hydroxylase 


143 


53 


/\ AAA J J 

0.00011 


Seq. ID No. 8 




AB046854 


Human cDNA 
KIAA1634 


142 


59 


0.00020 


Seq. ID No.9 


15 


R00585 


Unknown 


149 


57 


1.28E-09 


Seq. IDNo.10 




S45844 


Spi-B transcription 
factor 


140 


43 


0.00043 


Seq. IDNo.11 




X98311 


Carcinoembryonic 


6137 


223 


0.00044 


Seq. IDNo.12 


20 




antigen family 
member 2 (CGM2^ 












BAA78050 


NADPH 
oxidoreductase 


153 


84 


0.00048 


Seq. ID No.40 


25 




homolog 










N72128 


Unknown 


164 


77 


0.00068 


Seq. ID No. 13 




AB040955 


Human cDNA 


334 


120 


0.00067 


Seq. ID No.14 


30 


AF125101 


HSPC040 protein 


363 


115 


0.0011 


Seq. ID No.15 


AB023229 


Human cDNA 
KIAA1012 


263 


88 


0.00099 


Seq. ID No.16 




N95761 


a-L-fucosidase 


429 


104 


0.00047 


Seq. ID No.17 


35 




gene 










AK025033 


Human cDNA 
FLJ21380 


180 


85 


0.0010 


Seq. ID No.18 




L10844 


Human cellular 


206 


101 


0.0013 


Seq. IDNo.19 


40 




nrn\A/th rpnnlatinn 

protein 












H96534 


H. sapiens mRNA 
for gp25L2 protein. 


147 


58 


0.0015 


Seq. ID No.20 




AK001521 


Human cDNA 


157 


60 


0.0019 


Seq. ID No.21 


45 




FLJ10659 












AF151039 


HSPCP05 n rote in 


117 


60 


0.0017 


Sea ID No 22 




AF052059 


SEL 1L protein 


168 


53 


0.0016 


Seq. ID No.23 


50 


N24597 


Unknown 


166 


62 


0.0016 


Seq. ID No.24 


AK001950 


Inner centromere 
protein 


148 


64 


0.0029 


Seq. ID No.25 




BAA02649 


Macrophage 


118 


44 


0.0031 


Seq. ID No.41 


55 




scavenger receptor 
type I 












N75004 


Unknown 


98 


48 


0.0031 


Seq. ID No.26 
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Table 1 (continued) 





I il 

Modulated Genes 


5 


ACCESSION 


GENE 

DESCRIPTION 


MEAN SIGNAL 

INTENSITY 

(NORMAL) 


MEAN SIGNAL 

INTENSITY 

(TUMOR) 


P-VALUE 






W16916 


Human cDNA 
KIAA0260 


162 


61 


0.0037 


Seq. ID No. 27 


10 


X52001 


H.sapiens 

pndothplin ^ 

mRNA. 


89 


33 


0.0042 


Seq. ID No.28 




TC07QQ 
1 JU / DO 


\J\ 1 M IUWI 1 


364 


102 


0 005Q 


Sea ID No 38 


ID 


AJ005866 


Putative Sqv-7 like 

nrntcin 


381 


163 


0.0049 


Seq. ID No.29 




AF1 13535 


MAID protein 


218 


100 


0.0053 


Seq. ID No.39 


20 


AB037789 


Human cDNA 
KIAA1368 


164 


62 


0.0068 


Seq. ID No.30 




M33987 


Carbonic 
anhydrase 


652 


46 


0.0074 


Seq. ID No.31 


25 


M77830 


Desmoplakin 1 
(DPI) 


184 


81 


0.0092 


Seq. ID No.32 




H81220 


EST domain 
tanscription factor 
ELF1 


113 


55 


0.017 


Seq. ID No.33 


30 


AF000592 


Human 
chromosome 
21q11-q21 
genomic clone 


33 


69 


1.16E-05 


Seq. ID No.35 


35 


AK021701 


Human cDNA 
FLJ11639 


31 


63 


0.00070 


Seq. ID No.36 



Example 4: Optimized Portfolio for Colorectal Tumors. 

[0050] The mean-variance optimization algorithm was used to generate a multiple genebased signature, where the 
genes that are included can be used in combination to distinguish between the normal and tumor samples. Intensity 
measurements were processed using the samples and microarrays described in Examples 1 -3. The data to be analyzed 
was first preselected based on a pre-specified 5-fold differential between tumor and normal cells. The expression data 
from genes preselected according to this criteria were then used as follows. The mean and standard deviation of the 
intensity measurements for each gene were calculated using the non-metastatic samples as the baseline. A discrim- 
inating value of X*(Standard Deviation + Mean) was then calculated for each baseline gene ( X was assigned a value 
of 3). This value was used to ensure the resulting portfolio would be stringent. A ratio of the discriminating value to the 
baseline value was then calculated for each metastatic sample. This ratio was then converted to a common logarithm. 
This data was then imported into Wagner Software which produced an efficient frontier from which a portfolio of 4 
genes was selected. The set included an unknown sequence, procollagen type I, large subunit of ribosomal protein 
L21 and fibronectin. These genes are identified as Seq. ID No 42, Seq. ID No. 43, Seq. ID No. 44 and Seq. ID No. 45. 
Alternatively, a combination of genes used to make up the portfolio can be used to produce diagnostic information that 
is useful for making clinical decisions regarding colorectal cancer. This is particularly beneficial in the case when a 
combination of genes selected from the portfolio are combined with additional markers (genetic or not). 
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Optimized Gene Portfolio: 
[0051] 

5 >gil1264443lgblN92134.1IN92134za23f09.r1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE: 

293417 5' similar to gblM87908IHUMALNE32 Human carcinoma cell-derived Alu RNA transcript, (rRNA); gb: 
X57025_mal INSULIN-LIKE GROWTH FACTOR IA PRECURSOR (HUMAN) 

>gil2221047lgblAA490172.1lAA490172 ab06b08.s1 Stratagene fetal retina 937202 Homo sapiens cDNA clone 
IMAGE:839991 3' similar to gb:J03464 PROCOLLAGEN ALPHA 2(l) CHAIN PRECURSOR (HUMAN) 
10 >gil21 8891 8lgblAA464034. 1 1AA464034 zx86b09.r1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IM- 

AGE:810617 5' similarto SW:RL21 J-IUMAN P46778 60S RIBOSOMAL PROTEIN L21. 
>gil834491lgblR62612.1IR62612yi12d01.s1 Soares placenta Nb2HP Homo sapiens cDNA clone I MAGE: 139009 
3' similarto gb:X02761_cds1 FIBRONECTIN PRECURSOR (HUMAN); 

is [0052] Using a different set of criteria but the same method, a further four gene portfolio was selected by the software. 
These are Seq. ID no. 46, Seq. ID No. 47, Seq. ID No. 48 and Seq. ID No. 49. Two genes overlap with the first four- 
gene portfolio. The two optimized portfolios can also be combined to form a six-gene portfolio. 

Optimized Gene Portfolio: 

20 

[0053] 

>gil2114953lgblAA431245.1IAA431245 zw78d06.r1 Soares_testis_NHT Homo sapiens cDNA clone IMAGE: 
782315 5' similar to WP:F36H1 .2 CE05814 ANKYRIN LIKE 
25 >gjl21 561 72lgblAA443497.1 IAA443497 zw34d03.r1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IM- 

AGE:771173 

>gil2221047lgbl2IAA490172 ab06b08.s1 Stratagene fetal retina 937202 Homo sapiens cDNA clone IMAGE: 
839991 3' similar to gb:J03464 PROCOLLAGEN ALPHA 2(l) CHAIN PRECURSOR (HUMAN) 
>gi!1264443lgblN92134.1IN92134za23f09.r1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA clone IMAGE: 
30 293417 5' similar to gblM87908IHUMALNE32 Human carcinoma cell-derived Alu RNA transcript, (rRNA); gb: 

X57025_rnal INSULIN-LIKE GROWTH FACTOR IA PRECURSOR (HUMAN); 
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SEQUENCE LISTING 

<U0> WANG, YIX1N 

<120> COLORECTAL CANCER DIAGNOSTICS 

<1.30> CDS 267 US NP 

<140> TBD 

<141> 2003-03-21 

<150> 60/369,790 
<151> 2002-03-29 

<160> 49 

<170> Patentln version 3.1 

<210> 1 

<211> 1500 

<212> DNA 

<213> human 

<400> 1 

atggcttcga ccaccacctg caccaggttc acggacgagt atcagctttt cgaggagctt 
60 

ggaaaggggg cattctcagt ggtgagaaga tgtatgaaaa ttcctacLgg acaaggatat 
120 

gctgccaaaa ttatcaacac caaaaagctt tctgctaggg atcatcagaa actagaaaga 
180 

gaagctagaa tctyccgtct tttgaagcac cctaatattc tgcgacttca tgatagcata 
240 

tcagaagagg gctntcacta cttggtgttt gatttagtta ctggaggtga actgtttgaa 
300 

gacatagtgg caagagaata ctacagtgaa gctgatgcca gtcattglat acagcagatt 
360 

ctagaaagtg ttaatcattg tcacctaaat ggcatagttc acagggacct gaagcctgag 
420 

aatttgcttt tagctagcaa atccaaggga gcagctgtga aattggcaga ctttggctta 
480 

gccatagaag ttcaagggga ccagcaggcg tggtttggtt ttgctggcac acctggatat 
540 

ctttctccag aagttttacg taaagatcct tatggaaagc cagtggatat gtgggcatgt 
600 

ggtgtcattc tctatattct acttgtgggg tatccaccuL tctgggatga agaccaacac 
660 

agactctatc agcagatcaa ggctggagct tatgattttc catcaccaga atgggacacg 
720 

gtgactcclg aagccaaaga cctcatcaat aaaatcctta ctatcaaccc tgccaaacgc 
780 

atcacagcct cagaggcact gaagcaccca tggatctgtc aacgttictac tgttgcttcc 
840 
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atgatgoaca gacaggagac tgtagactgc ttgaagaaat ttaatgctag aagaaaacta 
900 

aagggtgcca tcttgacaac tatgctggct acaaggaatt tctcagcagc caagagtttg 
960 

ttgaagaaac cagatggagt aaaggagtca actgayagtt caaatacaac aattgaggat 
1020 

gaagatgtga aagcacgaaa gcaagagatt atcaaagtca ctgaacaact gatcgaagct 
1080 

atcaacaatg gggactttga agcctacaca oaaatctgtg acccaggcct tactgctttt 
1140 

gaacctgaag ctttgggtaa tttagtggaa gggatggatt ttcaccgatt ctactttgaa 
1200 

aatgctttgt ccaaaagcaa taaaccaatc cacactatta ttctaaaccc fccatgtacat 
1260 

ctggtagggg atgatgccgc ctgcatagca tatattaggc tcacacagta catggatggc 
1320 

agtggaatgc caaagacaat gcagtcagaa gagactcgtg tgtggcaccg ccgggatgga 
1380 

aagtggcaga atgttcattt tcatcgctcg gggtcaccaa cagtacccal caagccaccc 
1440 

tgtat tccaa atgggaaaga aaacttctca ggaggcacct ctt tgtggca aaacatctga 
1500 



<210> 2 

<2ll> 5761 

<212> DNA 

<213> human 

<400> 2 

cacaccgcag tatgcggtgc cctttactct gagctgcgca gccggccgqc cggcgctggt 
60 

tgaacagact gccgctglac tggcgtggcc tggagggact cagcaaairtc tcctgccttc 
120 

aacttggcaa cagttgcctg gggtagctct acacaactct gtccagccca cagcaatgat 
180 

tccagaggcc atggggagtg gacagcagct agctgactgg aggaatgccc actctcatgg 
240 

caaccagtac agcactatca tgcagcagcc atccttgctg actaaccatg tqacattggc 
300 

cactgctcag cctctgaatg ttggtgttgc ccatgttgtc agacaacaac aatccagttc 
360 

cctcccttcg aagaagaata agcagtcagc tccagtctct trcangfcct ctctagatgt 
4 20 

tctgccttcc caagtctatt ctctggttgg gagcagtccc ctccgcacca catcttctta 
480 

taattccttg gtccctgtcc aagatcagca tcagcccatc atcattccaq atactcccag 
54 0 
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ccctcctgtg agtgtcatca 
600 

caagcccogt agctctggac 
660 

tgattctcca gactctgact 
720 

tctccgaggc aatagtggat 
780 

tggcacccgc actatcattg 
840 

aacccaggcc tcaggtctcc 
900 

gtcatctgga tgctgtatca 
960 

agcacaacca ctcaatctta 
1020 



gagaagcagc aacccagccc 
1080 

cccctacacc ttccagcatg 
1140 

ggcccctgct cacctgccaa 
1200 

tgctgcactg ggctcaacca 
1260 



gcatgctgca gcctatacca 
1320 

tgggcccagc ctcctcactt 
1380 

caccoaatcc tacattggct 
1440 



tcctaccaag atcagccagt 
1500 

atggctacct tctcctggcc 
1560 



ccctcttgaa atttcttagc 
1620 



tttctctggg ggaacctgtc 
1680 



cctattttta aattcattat 
1740 



cccatcttct gcagttacca 
1800 

tttgtctctc tgacttgatt 
1860 



ctatccgaag tgacacLgat 
tgaagccaag gtctaatgtc 
cttctttgag cagcccttat 
ccgttttgga ggggcctggc 
tgcctccact gaaaactcag 
tgagcaataa gactaagcca 
cccccacagg gtatcgagct 
gccagaacca gcagtcatcg 
cccgcaggca gcaggcgttt 
gcagcccgct acactcgaca 
gccaggctca tctgtatacg 
gctccattgc tcatcttttc 
ctcaccctag cactttggtg 
ctgccagcgt ggcccctgct 
cfctcccgagg ctcaacaatt 
attcctactt atagttggtg 
ctgcgttctt aatattgggc 
cagcaacttg ttctgcaggg 
tcagtgttya ctycattytt 
ttttgt-.gaca gtaattttgg 
aggaagagag attgttctga 
tctataaatg cttttaaaaa 



gaggaagagg acaacaaata 
atcagttatg tcactgtcoa 
tccactgata ccctgagtgc 
agagttgtgg cagatggcac 
cttggtgact gcactgtagc 
gtcgcttcag tgagtgggca 
caacgcgggg ggaccagtgc 
gcggctccaa cctcacagga 
gtggcccctc tctcccaagc 
gggcacccac accttgcccc 
tatgctgccc cgacttctgc 
tccccacagg gttcctcaag 
caccaggtcc ctgtcagtgt 
cagtaccaac accagtttgc 
tacactggat acccgctgag 
agcatgaggg aggaggaatc 
tatggagaga tcctccttta 
gcccactgaa gcagaaggtt 
gtaylcttcc caaagtttgc 
tacttggaag agttcagatg 
agttaccctc tgaaaaatat 
caagtgaagc ccctcttcat 
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ttcattttgt gttattgtga 
1920 

tgatgacaaa aaaagaaaaa 
1980 

attttaaaag cggcttacac 
2040 

ttcagttttg ttttaatgtc 
2100 

gttacgtatt actctgtgtt 
2160 

agtagtgttt aaaaggcagc 
2220 

tgcgtgaaaa caccaagtat 
2280 

tttttaaaag tctttccctc 
2340 

ggtggttatt attacatggt 
2400 

agatactggc attgatgagc 
2460 

tgttttgctc atctctcccL 
2520 

cctgaaacca gataagaaca 
2580 

tgtatgccag cagcaaattg 
2640 

gaattgcaaa aaatatttta 
2700 

taatggtggt gtuttaatat 
2760 

aacaagcaat ttttcctgct 
2820 

acttgaattg tgtacttagt 
2880 

tgtctccatt gtatttaaac 
2940 

tgcaattata ttagagcata 
3000 

aaaggagacc cttggaattg 
3060 

gtctcttcct tccctttctt 
3120 

ggttaataga gtttacaqtq 
3180 
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ttgctggtca ggaaaaatgc 
ttactttttg tttgtttata 
aatctccctt ttglttattg 
atattatact taatgggcaa 
actattgaga ttctctcaat 
tcaccatttg ctggtaactt 
tctttttaaa tgaagcacca 
tctgattcag cttaaatttt 
ggtggtggtt ttattatatg 
tttgcctaaa gattagtatg 
tctgttttat gtgatttgtt 
tttcttgtgt atagctttta 
aatgctctct tattaagact 
aaaatttalt actgaattta 
tttacataat taaatatgta 
aacccaaaat gttatttgta 
gtgtatgtga tcctccagtg 
caaaatgaac tgatacttgt 
ttactgtagt gctgaatgag 
ttttgcacag gtgtgtctgg 
cctccttccc Llattgtagt 
agcttgcctt aggatggacc 



tgatagaagg agtlgaaatc 
aactcagact tgcctatttt 
gacatttaaa cttacagagt 
ttgttatttt tgcaaaactg 
tgctcctgtg tttgttataa 
aatgtgagag aatccatatc 
tgaattcttt tttaaattat 
tttatcgaaa aagccattaa 
caaaaCctct gtctattatg 
aattttcagt aatacacctc 
tggggagaaa gctaaaaaaa 
tacttcaaag tagcltcctt 
tatataataa gtgcat.gtag 
aaaatatttt agaagttttg 
catattgatt agaaaaatat 
atcaaatgtg tagtgattac 
ttatcccgga gatggattga 
tggaatgtat gtgaactaat 
caggggcatt gcctgcaagg 
tgaggagttt ttcagtgtgt 
gccttatatg ataatgtagt 
agcaagcccc cgtggaccct 
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aagttgttca ccgggattta tcagaacagg 
3240 



ctcagtttcc ctgccaacat tgaaaaataa 
3300 

tctacccctt tccattttgg attcLcggcL 
3360 



gctctctcac tgtgcattcc taccttgctt 
3420 



gtcaagccaa tattaaatat gcattctttt 
34 80 

ttttttttcc ttttcccatg tggcagtcct 
3540 

tatttgcttg ttgaaaaaaa catgttaaca 
3600 



tgcttaccat gtccccatac tatgaggaga 
3660 



tcacagaaag gtttcttagc tggtgaagaa 
3720 

altgaggctt t tgaggtttc ttttttaaca 
3780 

tgaaattgtc cttgtactct cagctcctgc 
3840 



gatggggaca ttcctgccca taaaggattt 
3900 

tgtgttccat ccgaattgaa aatgatatat 
3960 



tagatagaga tggtgtcaag gaggtgcagg 
4020 

cagccagctc tgtaccaggt tgaacaccga 
4080 



ttgtaaggag taagggcttc caagatgggg 
4140 



gttgtgtttt ctttattttt taaaatcatt 
4200 

gtcaagatag ccaagcagtt tgtataattt 
4260 

aacatgtgtg atctttgtgt ctcctttttg 
4 320 



acaggtctag tttclaaagg acaaattttt 
4 300 

gatttgttgt ttttgtaaga aatgagatgc 
4440 

cccagtccaa taagcagata ccacttaaga 
4500 



attagtagct gtattgtgto atgcattgtU 
aaacagcagc ttttctcctt taccaccacc 
gagttctcac agaagcattt tccccatgtg 
ctgtgagaat tcaggaagca ggtgagagga 
aaagtatgtg caaccacttt tagaatgaat 
tcctgcacat agttqacatt cctagtaaaa 
gatgtgttta taccaaagag cctgttglat 
agttttgtgg tgccgctggt gacaaggaac 
tatagagaag gaaccaaagc ctgttgagtc 
guttgtatag tcttggggcc cttcaagctg 
atggatctgg gtcaagtaga aggtactggg 
ggggaaagaa gattaatcct aaaatacagg 
ttgagatata attttaggac tggttctgtg 
aUggagatgg gagatttcat ggagcctggt 
ggagctgtca aagtatttgg agtttcttca 
caggtagtcc gtacagccta ccaggaacat 
atattgagtt gtgttttcag cactatattg 
ctgtcactag tgt.catacag ttttctggtc 
ccaagcacat tctgatlttc ttgttggaac 
tgttccttgt cttttttctg taagggacaa 
aggaaagaaa accaaatccc attcctgcac 
taggaqtcta aactccacag aaaaggataa 



17 



EP1 355149 A2 



taccaagagc ttgtattgtt accttagtca 
4 560 



agagattttt cagtcttagt ctgcaaactg 
4620 



cctgtgtctg ctgaatgtgt atgtatgtgc 
4680 



ttagccctgt tggccctgac aggaagggag 
4740 



ctgggtcaca gtgacctgac ctcaaaccag 
4800 



ggcatttcca acttcttcct ttccgggtga 
4860 

cactctgggc tcatagggac acttggtcac 
4920 

atattatttt cagtgctcag ctgaaatacc 
4980 

agttctggcc attctgagcc tgcttttgtg 
5040 



gctaagcttg actgccctta gccaggcaag 
5100 

tatgcaaaaa ttcactagtt gagalggttt 
5160 



agtgacagga gtggcccgag cctgcttcct 
5220 



atggtgcagc atgtctacat ggttgtttgt 
5280 



ttcagcttga aaaataatct cactacatgt 
5340 

tgttagtatt tctgctttga atccttgata 
5400 

atttgatatg ctagttattg tgtgcgattt 
5460 



ttgtgcgctt tcttttacaa caagcctcta 
5520 



tatgtttgta atgcagatgt acttaggqag 
5580 



tagatattta aaatttaata ctaactatgg 
5640 

atctttggat tcaatgtttg tctttggttt 
5700 



tacataatat ggtaaaatgt agagcaattg 
5760 



g 

761 



cttgcctagc agtgtgtggc tttaaaaact 
gcatttccga ttttccagca taaaaatcca 
tcactgtggc tttagattct gtccctgggg 
gaagcctggt gaatttagtg agcagctggc 
cttaaggctt taagtcctct ctcagaactt 
gagaagaagc ggagaagggt tcagtgtagc 
tccagagttt ttaatagctc ccaggaggtg 
aaccccagga ataagaactc catttcaaac 
attgctcatc cattgtcctc cactagaggg 
cacagtaatg tgtgttttgt tcagcattat 
gttttaggat aggaaatgaa attgcctctc 
attttgattt tttttttttt taactgatag 
tgctaaactt tatataatgt gtggtttcaa 
agcagracat tatatgtaca ttatatgtaa 
ttgcaatgga attcctactt tattaaatgt 
aaactttttt tgctttcf.cn ctttttttgg 
gaaacagata gtttctgaga attactgagc 
tatgtaaaat aatcatttta acaaaagaaa 
gaaaagggtc cattgtgtaa aacatagttt 
tacaaagtag cttgtatttt cagtatLttc 
caatgcatca ataaaatggg taaattttct 



18 



EP 1 355 149 A2 



<210> 3 

<211> 2129 

<212> DNA 

<213> human 

<400> 3 

ctgtattgag acaaaggaag ggatctgtca gaaagcaaca cttgttatct tgggcttggc 
60 

agcaaggaag aggacaggta gtggagatcc tgcaatctga aaagcagact gaaaggtgac 
120 

aaagaogctg aagatgggtg gtggagagag gtataacatt ccagcccctc aatctagaaa 
180 

tgttagtaag aaccaacaac agcttaacag acagaagacc aaggaacaga attcccagat 
240 

gaagattgtt cataagaaaa aagaaagagg acatggttat aactcatcag cagctgcctg 
300 

gcaggccatg caaaatgggg ggaagaacaa aaattttcca aataatcaaa gttggaattc 
360 

tagcttatca ggtcccaggt tactttttaa atct.caagct aatcagaact atgctggtgc 
420 

caaatttagt gagccgccat caccaagtgt tcttcccaaa ccaccaagcc actgggtccc 
4 80 

tgtttccttt aatccttcag ataaggaaat aatgacattt caacttaaaa ccttacttaa 
540 

agtacaggta taaaataaga caaatgttta aatttagtta tgttcacggg tagttgtcaa 
600 

ttggtctgaa acaaatttgc tagggaatct atttgtgtag aactaattaa tgtaaaaaaa 
660 

atagaccatc tcgtgttgtg tgcactgtga tataatggta gtatcagtgc aacttaaact 
720 

aatgattgta cttgatatta agtgttctca actgagtaac tlLtaagtgg aaaccaagtt 
780 

tagatttggg gagtggtaaa ggaatcagct ttttctattg ttaggggaag acagtaattt 
840 

atcattcatg gaccagtaga ttgttgaaag ttggtgaatc ggattataag cttctagcta 
900 

acacaaggat tcagaattag gtaaacatct gaaggtttag tatattagaa acacccaaac 
960 

cagtaatatg ctaacctgat gcactgctga aagaaaatgt gaatttttcg taataattgc 
1020 

attttagtga attgtacagt gggtggaaag ggcatttgga gutcatztaga atgagacata 
1080 

gtacacccca atggccctgt ttattaaatg tagtggatta agtgtctgtc aacaaataca 
1140 

ccaaaaccat tttttataga aacagtattt aatggtcact caatagcttt caaaatacat 
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1200 

ttttgtatta cagcactgca caagctattc taatagtgct ctgqcctcat cattcctcca 
1260 

aacjcllgctt tggggagttg gataalgtga aaattttaag tacctagggg agaaagagcc 
1320 

atgtaaatat ctgtaataaa cttgtagcat atgtaaagtt ttcttggcct ttatcttaca 
1380 

aaaatggagt attttagtat gaatttgctg aatgtaogac cgtggactgt tttttataat 
1440 

atggcctaat tttaaaggtc caaaataact tgtttttaaa gtttgccctt gtgctaaagt 
1500 

gccagtgtat gtatgttata cttgatttgg t tgtaaacta tatt tcaaag taaaccctag 
1560 

tgtaataagt tttataacta aaaaggttta agctgctaaa actattttta agagatgtga 
1620 



aatqcagtat gggactatct ttttttcctc ctctaaqccc aaagattaac taqagtccct 
1680 

ccaacettat agattgttgg ctttcacaat cttataacct aggatacagg Lagtttcgag 
1740 

tatggtgcca gtgatgtttt gtttttgttt ggtcaagggg taggtgcaac ocaatggacc 
1800 

acttatgcaa aagatgtaaa ctcttgcata atacattgat aacatgtttt gccaacttta 
1860 

aatgcttaaa cataagcgaa accagtagca agtatgtggg tcaqcttaaa aattttgar.t 
1920 

gttaaLgccc tattttctaa L ttggcacct ettgatgcct aaycaggtaa gcagatgcct 
1980 

aagctgtatt tctccaaata aatcaagatg aagtactgcc caagttaaat attgatagcc 
2040 



taaagacaag tttatgtagt acttaatgta catgatatga agcataaaat taaataaaat 
2100 

ttttccccat tgaaaaaaaa aaaaaaaaa 
2129 



<210> 4 

<211> 3950 

<212> DNA 

<213> human 

<400> 4 

cgagaactag t tttgfctccg tgccctctgg actggaacct tttggagaga acccccggca 
60 

ggaccaaccc cgcacccgcc agcaccgcgg caatgtccag caatagtttt: ccttacaatg 
120 



agcactccgg aggaggggag gcgacggagc tgggtcagga ggcgacctca accatttccc 
180 



20 



cctcggqggc cttcggcctc 
240 

tgllagagag caacaaagat 
300 

ttgcaaaagg gaaaaatgca 
360 

aaaatattga gatcaagaag 
420 

aggatcttgc actcctgtcc 
480 

taattcgtgc aagcgctttg 
540 

tcatgatgct tgctattaag 
600 

cagcccatgc aatacaaaaa 
660 

aagtaattga aaaacttctg 
720 



cttttgaaga agtatgcccg 
780 

gtaacttact agtggatgtt 
840 

gatatgctcg gacacagttt 
900 

gaaagaattt ctacgaatct 
960 

cgtatactat ggatccagat 
1020 



gcaggaatgc tgcggtggut 
1080 

ctgaagctgg cataatttct 
1140 

agtatattgt cct.acaaaat 
1200 

cttatctgaa gagtttctat 
1260 



ttgaaatttt gacaaacttg 
1320 

agacctatgt gaaaagccag 
1380 

gaLgtgcaac caacatcttg 
1440 



tgtccaacag ggatgaaata 
1500 
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Cttagcagcg atttgaagaa 
tctgctaaac tggatgctat 
tctgaactgt ttcctgctgt 
ttggtatatg tttacctggt 
ataagcactt ttcagcgagc 
agagttctgt caagtattag 
gaagcttctg ctgaeltatc 
ttatacagcc ttgatccaga 
aaagataaaa gcacattggt 
gacagaatag atctgattca 
gaagagtggg ggcaggttgt 
gtcagccctt ggaaagaggg 
gatgatgatc agaaggaaaa 
catagactct taattagaaa 
atggcagttg ctcagctgta 
aaatcactag tgcgtttact 
atagcaacta tgtcaattca 
gttaggtcaa ctgatccaac 
gcaaatgaag ccaacatalc 
gataaacaat ttgcagcagc 
gaagtcactg acacgtgcct 
gttgttgctg aaagtgtggt 



gaatgaagat claaagcaaa 
gaagcggatt gttgggatga 
tgtgaagaat gtggccagta 
tcgatatgct gaagaacagc 
tctgaaggac ccaaaccaac 
agtgccaatt attgtaccta 
accatatgtt aggaagaatg 
gcagaaggaa atgttaattg 
agctgqcagt gttgtgatgy 
taaaaattac cgcaagctat 
cataatccac atgctaactc 
tgatgaatta gaagacaatg 
gactgacaaa aagaagaagc 
tacaaagcct ttgcttcaga 
ttggcacata tcaccaaaat 
tcgtagcaat agggaggtgc 
aagaaagggq atgtttgaac 
tatgatcaag acactgaagc 
aactcttctt cgagaatttc 
cactattcag actataggca 
caatggcttg gtctgtctgc 
tgttataaag aaattactgc 



21 



aaatgcaacc tgcacaacat 
1560 

tcacligttcc tgttgctaga 
1620 

ttcctaaaat tqcccctgat 
1680 

atctggtaaa actqcagata 
1740 

agacaaaatt gcttacccag 
1800 

tcagagaccg tacaagattt 
1860 

taagtaaata tgccaaaaaa 
1920 

cttttaaaga tagagatcat 
1980 

ctactgggta cctggaatta 
2040 

atgtagaagt aatagagttg 
2100 

attctgctaa gaagttttat 
2160 

gtgacagtga gaytyaaLct 
2220 

gagacagcaa tgaggacagc 
2280 

gggagtcagg cctagaaaac 
2340 



gtgattctga agatggggag 
2400 

acgaatctag ttcaatagaa 
2460 

gtgaatctga atccagaaga 
2520 

ctcctcttac caaagatgtt 
2580 

caottgcact tcccacacca 
2640 

acttgtcaac ttcctcttca 
2700 

ctcacgtgct gcttcatcga 
27 60 

gacagccttg catttttggt 
2820 
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ygtyaaatta LLaaacatat 
gcaagtattc tttggctaat 
gttttgagga agatggctaa 
ttaaatctgg gagcaaaatt 
tacatattaa atctcggcaa 
attaggcagc ttattgttcc 
atattcctag cacaaaaycc 
ttccagcttg gcaccttatc 
tctaattggc cagaggtggc 
gcaaaagaat ggaccccagc 
tctgaatctg aggaagagga 
ggaagtgaaa gtggagaaca 
agtgaggact cctccagtga 
aaaagaacag ccaagaggaa 
aaggaaaatg aaaaatctaa 
gacagttctt ccgattctga 
gtcactaagg agaaagaaaa 
tcacttctag atctggatga 
gctctttctc caagtttgat 
gtcatcagtg Lcagtactcc 
atgagtggaa aaggactagc 
gataagatgg tctctataca 



22 



ggccaaactc ctggacagta 
tggagaaaac tgtgaacgag 
aagcttcact agtgaagatg 
gtntttaacc aactccaaac 
gtatgatcaa aactacgaca 
gaatgaaaag agtggagctt 
tgcaccactg cttgagtctc 
tcatactctc aacattaaag 
gcccgaccca tcagttcgaa 
aggaaaagca aagcaagaga 
ggactcttct gatagtagca 
aggcgaaagt ggggaggaag 
gcaggacagt gagagtggac 
ctcaaaagcc aaaggaaaaa 
aacttcagat tcttcaaatg 
atcagagtca gaacctgaaa 
gaaaacaaag caagatagaa 
ttttaaccca gtatccactc 
ggclgatctt gaaggtttac 
tgcatttgta ccaacgaaaa 
tgcccattat ttctttccaa 
aataacactg aataacacta 
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ctgatcgaaa gatagaaaat atccacatag gggaaaaaaa acttcctata ggcatgaaaa 
2880 

tgcatgtttt taatccaata gactctcttg agcctgaggg atccattaca gtttcaatgg 
2940 

gtattgactt ttgtyaltct actcagactg ccagtttcca gttgtgtacc aaggatgatt 
3000 

gcttcaatgt fcaatattcag ccacctgttg gagaactgct uttacctgtg gccatgtcag 
3060 

agaaagattt taagaaagag caaggagtgc taacaggaat gaatgaaact tctgctgtaa 
3120 

tcattgctgc accacagaat ttcactccct ctgtgatcfct tcagaaggtt gtaaatgtag 
3180 

ccaatglagg tgcagtccct tctggccagg ataatataca caggtttgca gctaaaactg 
3240 

tgcacagtgg gtcattgatg ctagtcacag tggaactgaa ggaaggctct acagcccagc 
3300 

ttatcataaa cactgagaaa actgtgattg gctctgttcL gctgcgggaa ctgaagcctg 
3360 

tcctgtctca ggggtaacct gcttacatct ggactttaga atctggcaca caacaaaagt 
34 20 

gcctggcatc cactactgct gcctttcatt tataataata gcccttccat ctggcagtgg 
34 80 

gggtagaata cactcttgac attcttgtct cctgctttag aatgctagtg tgtatctatc 
3540 

atgtatgcaa tactttcccc ctttttgctt tgctaaccga agagcatata ttttactgtc 
3600 

agttgtctca actcttgaat ccatgtggcg ttttctctgt cctgctgctt cttttggcct 
3660 

cctcgttttc cttctctttt tcgacaatgg tagacatgaa tgagatattt aaagttcatt 
3720 

ggaaatcttc ttccctacag cagtaagcaa aaattagcaa agagatagtc taaatggcct 
3780 

ctcagcttgg tatgtgaaaa tcagatcaca tactttttaa atccaaatac aaaagcatag 
3840 

tctctgcaag attttgttct ttgaatttct tgatattgta attgattatt gataactgtc 
3900 

atcatgaaat tatctctcaa taataagata aataaactag catatgaatc 
3950 



<210> 5 

<211> 5191 

<212> DNA 

<213> human 

<400> 5 

gagaaagaaa aacagctcga g.acctcatgc aaagagaaaa ctgagtatcfc acagaaaatg 
60 
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gttcagagga atgaaagata taaacaagat 
120 

ttagatttaa ttgagatgct tgaagcaaaa 
180 

caggaatatg aagaagtaaa actagttcgt 
240 

aaagaagggc agattcctat aacatgtcga 
300 

ttggaggctc qaatcaaaga aaaggcaaca 
360 

cagaagcaag atgttataga aaggaaagat 
420 

atagtaaagc aaaatgaaga gcttgaccga 
480 

atagaggatt tgcaaaatga actaaagacc 
540 

attgatgcca ttacaaatga ictgagacgg 
600 

yaaataattg ataagcgaag agagagggaa 
660 

gatcatattg tacgttttga caatcttatg 
720 

ttccgtgaca cgtatgatqc tgttttatgg 
780 

agagtctgtg agcccataat gctcacgatr. 
840 

attgaaaatc atattccatc aaatgactta 
900 

atggaggttt tcctcaaaga ggttcgtgac 
9G0 

gctcccaaga gttcatatgc agacaaagca 
1020 

tacggatttt tcLct tattt gagagaatta 
1080 

ctttgctgtc agtatcatat tcatgaagtt 
1140 

attgaacggg taatacaaga aacccgatta 
1200 

gtggtgaaaa cttcttttta ttcaaacaaa 
1260 

gcccagtttc tcactgtcac tgtggaccta 
1320 

aaggnaattc atagaaaatt gcaagcagtg 
1380 



gtggagaggt tctatgaacg gaagcgacat 
aggocatggg tggaatatga aaatgttcgt 
gaccgagtga aggaagaggt cagaaaactt 
attgaagaaa tggaaaacga gcgtcacaat 
gatattaagg aggcatctca aaaatgcaaa 
aaacatattg aggaacttca gcaggcttta 
cagaggagaa taggtaatac ccgcaaaatg 
acggaaaact gcgagaatct tcagccccag 
attcaggatg aaaaggcatt atgtgaaggc 
actctagaga aggagaaaaa gagtgtggac 
aatcagaagg aagataagct aagacagaga 
ctaagaaata acagagacaa atttaaacaa 
aatatgaaag ataataaaaa tgccaaatat 
agagcctttg tatttgaaag tcaagaagat 
aataaaaaat taagagtaaa tgctgttatt 
ccttcaagat ctttgaatga acttaaacaa 
tttgatgcac ctgatcctgt aatgagttac 
cctgtaggaa ctgaaaagac cagagaaaga 
aaacagattt atacagcaga agaaaagtat 
gttatttcta gtaacacatc tctaaaagta 
gagcagagaa gacacttaga agaacagcta 
gattcagggt tgattgcctt acgtgaaaca 



24 
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agcaaacatc tggagcacaa 
1440 

agaaaaacca aqaaaagaca 
1500 

ctgatggaac aggatacttg 
1560 

aaagaaataa atgttcaaaa 
1620 

tgtacttcut tgcatataca 
1680 

gagaagaaca aattagaatc 
i740 

caacatttca ttgaattgga 
1800 

atgaaaagag ctaggcaagt 
1860 

cagacacaag tacccaccat 
1920 

gaccttccaa acacaLtgga 
1980 

tcctgcttca cgggactgaa 
2040 

atagaacagt taactgagga 
2100 

aacatttcac aqgtaaaaga 
2160 

aatgaaaaaL tcagcaattt 
2220 

acagaaaatg aggaagatta 
2280 

agtactcaac tgcatgaatt 
2340 

accatgttat acttgatggc 
2400 

gaaatcaatc agggaatgga 
2460 

actgcctgta aagaaaatac 
2520 

cttccttatt ctgaaaagat 
2580 

ccaaacacat ggaatttaaa 
2640 

ccttcLUaat aaaagtaaag 
2700 



agacaatgaa cttagacaaa 
actggaacaa aaaatcagtt 
caatcttgaa gaggaagagc 
agcgaaactt gttaccgaat 
aaaagtagat ttaattctcc 
agalLalalg gccgcatctt 
tgaaaataga cagagattat 
atgtaacctg ggtgcagagc 
tccaaatgga cacaactcct 
tgaaattgat gctttat taa 
tcctacaatt gttcaggaat 
actaaaggga aagaaagttg 
aaggtggctt aatccLLtaa 
ttttagttcc atgcagtgtg 
tgataaatat ggaattcgaa 
aactcctcat catcaaagtq 
acttcaggag ctaaatagat 
cccaatcaat gaacggagag 
atctcaatac tttttcataa 
gacagttttg tttgtctaca 
ggctttccaa aggcggcggc 
agagggaact tgggaatttt 



agaagaagga gcttcttgag 
ccaaacf.agg aagtt.taaag 
gaaaagcaag taccaaaatc 
taacaaacct aataaagatt 
aaaatacLac agtgatctct 
cacaactccg tcttacagag 
tgcagaaatg caaggaactt 
aqactcttcc tcaagaatac 
cactccccat ggttttccaa 
ctgaagaaag atcaagagct 
atacaaaaag agaagaagaa 
aactagatca atacagggaa 
aagagctggt agaaaaaatt 
ctggtgaagt tgatctccat 
ttagagtcaa atttcgaagt 
gaggtgaaag aagtgtttct 
gtccattcag agtagttgat 
tgtttgaaat ggttgtaaat 
caccaaagct cctgcaaaat 
atggccctca tatgctggaa 
gccgtattac attcactcaa 
ttttgttaaa ttctgtttat 
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aagtatggct caactgaata aaaggagatt cactaaaacg aaaagcagtt atttttggaa 
27 60 

5 acctgctttt aaatacaaat aggttgataa tggaaactat aatgaccttt ccaaaatagc 

2820 

agctggtaqt aaaagttaag tcttcttcag tcttggttga acttgagttc ttggcactct 
2880 

10 gaccatgagt cattcagttc tcatgttaaa atgtacttaa tattacaaat caaaggtaca 

2940 

gtggaagaag ggttaatcac aagaagttac ttatatggta gccctgagct ttaattgcag 
3000 

15 agtaacttta attactttta gagcctaaag atgactctag agcctaagtc ctagtttctc 

3060 

ccaatgttat atttaatttt aaaaaattga tatgaaaatg tctaatgtat agtaataatt 
3120 

20 tatgacagat ctagtcattt cttcctAtta aaaaagatta ccttatctcc agtaggaaat 

3180 

ggaattttat gggcctttaa aagaaagttt tatgaaactt gatgctataa ttttattggt 
3240 

25 atttcaaggg gaaaaaagca ctggggttca aaaatggtag cagaactgct ttgaaatgct 

3300 

gcaaggtggc cactagatga tgcaaaatac aaccaaaaga ttgactgaga ataaaattag 
3360 

30 gtgacaaggg tttttaaaga ataacctttt aaagtgtggg ggcaggggtt gctttttttt 

3420 

attttattta aagtcaatta tattttacat cttacatttc taaaagcatt ttataattat 
3400 

35 ttttagtaag atttttctta aaatttcata tactggtttc tacaatttaL aLttgaaatt 

354 0 

IcLcagtgtt atgtaaagag tgatggaaaa gcattgattt ctttaaaacc gtaatgtttt 
3600 

40 tagaacttaa gcctataggg cctttcttac aatgttgatg tacccattat cttagaoaat 

3660 

ctagtttaaa ctgttttctt tcaccgcaaa agaattaaat gggaaaatca tttgtttatc 
3720 

45 tctaagttat actaattagt agaaccaaac aaattatctt cttttaaaaa ataaatctta 

3780 

taggaaaar.a gacagtccaa agtcatgtct ttgaacagtg gattggatct gtgccagtaa 
384 0 

50 tgacaaaatt attt.ttt.tga cttgcttgcc tgaataaatt gaagaattgc tttcagtttg 

3900 

ggttttgtat attcttaagt agccattgaa atttatattc ttaactaggt caaaaaataa 
3960 

55 tgagccataa gtttatgtcc tctcacttag acattttctc tttaaaaagg tattttcttc 

4020 
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tttataaaca ttttaaaaga gccttccctt cttaaactaa ctccagtgca tgaagtgtga 
4080 

5 aaatatttta aaatgacatt tttactaata tgagcaagtx atgtaaacat tgaagaactt 

4140 

ggtaacatat tagtaaatgg atattaccaa atgttttcat cgttaattac tttqcgttcc 
4200 

10 accaaaatat ctttactaaa atgtgcttgg tgtagtttgt ttattgtcta aattagtacc 

4260 

agtcatctta tttctgcaaa atgagtatca atgtgaaaaa gacacgtgaa gattaagcat 
4320 

15 gtttgaaaat aaaatggtca attacatttc aatttacata ggccaacaac tgttccatac 

4 380 

tttgtttgta aacatttaat ttctctactq gacaaaatta atatttggct ttacattgaa 
4440 

20 ttttgagctg tgaagaataa attatgtatc attttagcat attaaacagt agtaagtcta 

4500 

gcacatagtc tcagccactt aaaacaaaag tttttttgtt tgtttgtttg tttgtttttt 
4560 

25 tgagatggag tctcactctg ttgcccaggc tggagtgcag tggcgtgatc tcggcttact 

4620 

gcaacctccg cctcccgggt tcaagcgatt ctcctgcctc agcctcccaa gtaactggga 
4680 

30 caacaggcgc gtcccaccac acccagctaa ttttttatac ttttagtaga gatggggttt 

4740 

cagcatattg gccaggctgg tctcgaactc ctgaccttgt gatccacccg cctcggcctc 
4800 

33 ccaaagtgct gggattatag gcgtgagccc ctgcacccgg ccaaaagttg atttttaatt 

4860 

acataaaaat cgtaaaaact tctagtaaaa acttgatttg gtgaatacag ttatatttta 
4 920 

40 aaaccttaag gtgacaagca ttttctatgc ctaaatcttc attggtttgc ctggaaagag 

4 980 

tctctgttaa aagattttcc atattcaaag taaaaggaaa gatttcttgc ttcctaattg 
5040 

45 tcttttggac acatgcctat tttctttgag gtataaacct ttagatgtga aaaatgtaat 

5100 

ttcattctqc tattgtgtgt gcttgtgtqt gtgtaattga aaaaactggg aaatcctgct 
5160 

50 ttgttggtaa taaatcaata tttttatatt c 

5191 



<210> 6 

<211> 4755 

<^12> DNA 

<213> human 
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<400> 6 

aagagatctt ccaggctctc 
60 

aacctctgga ggaggctgga 
120 

aagaccaagt cagccttctc 
180 

agcctggagc taggccgggg 
240 



gtcaacacca aggcagaaga 
300 

gaccggcagg ttLcclaccc 
360 

agcacagacc caagtcccga 
420 

ggccggaggg ggcccggccc 
480 

ccccactgct cggtgctgga 
540 

agaccgagtg tgggcggctc 
600 



acttccagta cttctgggaa 
660 

tcagctgaac ccctaggcaa 
720 



gaggcttccc ggcagccctg 
780 



ggcccccaga ggccggacgc 
840 



gagccagaga gggagcccga 
900 



gatgccccct tcagccgcgc 
960 



ggggccacct cctttcgacg 
1020 



tggcggccac ggccttcctc 
1080 



gcctccccgc acacgccccg 
1140 

aggcccgtgc cccctgccgc 
1200 

aagcgctccl actcggagcc 
1260 

ccggcacccc tgggcccgca 



agagccctgg gagggcgatt 
gagayaaygc ctaggccaga 
atctctccag aacattcctg 
aacccaggag ggttaccccg 
ccctgggagg aaagccgctc 
gcggcccgag gggaggaccg 
agagccgcct gccccctcgc 
aggcagcgcc tcggctcttc 
gaaggtctcc aaatlcgagc 
tggttttggc cataactata 
tgacttcgag gagacaaaag 
cggggagcag cacttcaaaa 
cyglcagcag ctgagcggag 
tcggctcctc cgtagccaga 
gtggcgggac aggcccggct 
ctaccggaac agcatcaagg 
tcgagacctg gagctggggg 
ggcccacgtg gggctgcgga 
ggagcggcac agcgtgaccc 
ccggagaggt gctcgccggc 
cgagaagatg aacgaggtgg 
gagaaatggg atgcgtttcc 



tccaggaaga ccacaatgcc 
gcctgtcagg caactttggc 
agagtctgag aagacacagc 
ggggcaggcc cacctgtgca 
ctgacctcgg gagccaf.ctg 
gtgcctcggc ttctttcaac 
acccgcacac atccagtctg 
agggctttca gtacgggaag 
agcgagagca agggagccag 
ggccccacag gaccgtctca 
cacacattcg tttctctgag 
acgggyagcL gaagttggaa 
gagcgtcgga cagcggccgt 
gcaccttcca gctctccagc 
cgcccgaatc gcccctgctg 
acgcacagtc ccgtgtcttg 
cgcccgtggc gtcgaggtcc 
gccccgaggc gtcggcctcc 
ctgctgaggg cgacctggcc 
gcctgactcc cgagcagaag 
qgalcylgga ggaggccgaa 
cggagagcag cgtggccgac 
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1320 

cggcgccgtc tcttcgagcg 
1380 

gagctgaagc agttccagca 
1440 

cggcctacct ccyccgccgg 
1500 

cagagtgcct acctccagcc 
1560 

tccagcttga gctcactgcg 
1620 

ccggccacag ttgcagaaac 
1680 

ggccgccgcc tcggggaacg 
1740 



aggggcaccc agagagggga 
1800 

gggaagtcca tgtcggccga 
1860 

catgtgaggt ccaggtcatc 
1920 

caagacagtg gctttggtct 
1980 

tcactcagtt gttcagaaag 
2040 

cctcgttggg gtggttcagg 
2100 

cctggaaccc Lggaccatca 
2160' 

gcaggaacgc aagggctggt 
2220 

cctctgcctt cagccattcc 
2280 

cctctcccgc cctacacccc 
2340 

cctcccggtc caagaggctg 
2400 

aggaagaggg tctcgctgcc 
24 60 

gccagagagg acagccttcc 
2520 

tatcaaaaac agcagagtct 
2580 

ggggccccga gcactccagg 



cgatggcaag gcctgctcca 
gagcgccctg gcggactaca 
ctgcagcctc caggagcccg 
cggccccgcg gcgctcgaag 
ggagcccagc ctgcagcccc 
ccaycaggct ccccgagatc 
gcgacgcggg gacctgctta 
tgagaccccc agggagccat 
ggacctgctg gaacgctcgg 
tcccgccacc gcagacaagc 
tgtgaaggat ccatgttatt 
aggccaagaa gagatgctgc 
ctgcaaagcc atlggtgatt 
gaggcaagcc agtaggacac 
cacagacacc agggctgcac 
ctctggctac tgctcacagg 
tgccatgatg cacagaagca 
tgagggcgat ggcccagagc 
tcagLggcca cctccttctc 
tgaggaatcc tcagcccctg 
tccaagttta tgcagcactt 
gaggatctcc ctccgaatat 



cgctcagcct gtcgyggccc 
tccagcgcaa gaccggcaag 
ggccactgcg tgagcgcgcc 
gctccggcct cgcctcggcc 
gcagggaggc cacgctcctg 
gcagcagctc cttcgccggt 
gcggagcaaa cggtggaaca 
cctcctgqgg ggccagggcc 
acgtccttgc gggccctgt.c 
gccaggatgt gcttttgggg 
tggctggtcc tggatctagq 
tgctcttcca ccatctcacc 
cctccgttcc tagtgaat-.gt 
cctgccccag gccaccactg 
ccctgacccc aattggcacc 
acggtcagac agggcgacag 
atggtcacac cctgacccag 
atggggtaga agagggaacg 
gagcaaagtg ggcccacgca 
attttgcaaa cctgaagcac 
ctgacccaga c;acacctctt 
ctgagtctgt cctgcgggac 
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2640 

tccccgccac ctcatgagga 
2700 

gccacgtcca gccccacatt 
2760 

gaaaccccgg tyLatagcat 
2820 

gcgcagctgg acagtgagga 
2880 



gtgacaattg caagggaaag 
2940 

ctggcttcca gactccaaac 
3000 

ttcatgagcg ttcacgccca 
3060 

actcaaagcc tcagccatga 
3120 

cctgatcctc agaagagttc 
3180 

caccaagaca aatctctagc 
3240 

gacctgatgg aaggtttgtt 
3300 



aggaaggcca tacagagaac 
3360 



aaggaagcag tgagcatgtt 
3420 

gctgagctac tgaacaaaat 
3480 

gcagatgtca atgaaaagaa 
3540 

ctccaggagg cgaaggggag 
3600 



gaggtggagg ctctgatcag 
3660 

ttcatagggg atttggacaa 
3720 

cgtgttgaga atgtccttag 
3780 

ctttacgaga aaaggaagat 
3840 



aacctggatc gcagggagcg 
3900 

cagctccagg actaccagca 
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ttatgaagac gaagtgtttg 
tgaacctctt cccccacccc 
ggatgacttc cctccacctc 
tcccgagggg ccacgcccca 
gcacatgcct ggtgcagccc 
Itctatcaag ggttcagagg 
acttgctggg tctcttggtg 
tccagtcagt ggaactcagg 
agaagacatc agaacagagg 
agacattttg gatccagact 
tccccgagat gtgaacttgc 
tgtcagctct tcaggatgtg 
ggttaaetgc cctgcctact 
caaagagatg ccagcagaag 
ggctgagetc attggaagtc 
cctgctcacg gacatcaagc 
cgagctctgc aagcccaatg 
ggtggtcaac ctgctgctct 
cggccttggt gaagatgcca 
cctggctggt cagcatgagg 
agtagtqctg ggcatcttgg 
cttcgtgaaa atgaagtcca 



tgagggatcc gcaccccaag 
cacctcctcc accgagtcag 
ctccccacac tgtatgtgag 
gcttcaacaa actttctaaa 
atgtggtagg tagtcagaca 
ctgagtccac accaccctcc 
ggcagccagc acccatccag 
gtttagaaaa gaaagtcagt 
ctttggccaa ggaaattgtc 
ccaggctgaa gacaacaatg 
tgaaggaaaa cactgtaaag 
aaggcaagag gaatgaagac 
acagtgtgtc tgctcccaag 
tgaatgagga agaggaacag 
tcacccacaa gctggagacc 
tcaacaacgc cctgggagaa 
agtttgacaa gtataggatg 
ccctctcggg gcgtctagcc 
gtaatgaaga aaggagctct 
atgcccggga qctgaaggag 
ccaattacct ttcagaggag 
cgctcctcat tgagcaacgg 
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3960 

aagctggatg acaagatcaa gctgggccag 
4020 

ccctcagatt tcattcccaa ggctggggcc 
4080 

cccattcctg ctgggggctg tactttcagt 
4140 

taacctcttc taaaataccc aaccaaaaga 
4200 

aaaaatgttt cagtacaaac cactgtttga 
4260 

tgaaaggaaa aaaattctct ccaggaggaa 
4 320 

tcttctgaga gctcgaatgc tgctggacac 
4380 

aaagaaagtt aatgaaactg agaactgatt 
4440 

ggctgaggca acatggatca gtgtgtgtcc 
4500 

ttgctggLgg gcagtgtatc ctgatggcag 
4560 

acataaagcc ttaaaaagac acacactaag 
4620 

atatttttta aaatggaaaa gatcactatg 
4680 

ttttgtggtg yacatagatg attacgtttg 
4740 



atgaattcca aagat 
4755 



<210> 7 

<211> 2045 

<212> DNA 

<213> human 

<400> 7 

gaaacttgac cccggctcat cctgtctctg 
60 

tctggtctca gagacagtag gggcagtgcc 
120 

gagtaaUtct gaaaaaaatt attccf.aaac 
180 

cccagaagta ttttacaacc agagtgggta 
240 

tctctgaata ttgaaaacaa caactaaaaa 
300 



gagcaggtca agtgtctgct ggagagcctg 
ctggctctgc ccccaaacct cacgagtgag 
ggtattttcc caacattaac ctctccactt 
tcactgtttc tctcaacact atttaatctg 
actatctggg ttattggtgt ttgttcctga 
gcctttttcc ttcttgccct tcctgattga 
gtaccccttt ctattattac tttgtagtag 
ggagggtgtt tgatcattta gtttf.taaca 
ccctcaggaa tgtatccaca gtggccttcc 
ggtacaagta ccattaatga agggtctgca 
aaaactgtaa aaccttgaac attgttattt 
ttLgttgtgc taaccactta tttgattctg 
agctttgtat tttgtgaaaa ccttaatgaa 



gctgtggccc ggcaaagcac tgaaaacccc 
actttctaca acctgccaac ccacacactg 
tctctaagtg tggacggaga atgagcaagc 
atgaggaggg ggcttactgg aatcgtcata 
agtiggacctt ctcagaaaaa aagggcagca 
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aatgaccaaq ggcgcccctt 
360 

cccccatcac agggctttca 
420 

atgtttcttt ctccatcggc 
480 

gaggtgcact tctgaatgct 
540 

ccaggaacca tcgccttcta 
600 

aaaggctgct attgtgcaag 
660 

tgcatggcaa agagagagat 
720 

ccataacaat tctaacactt 
780 

tatttlgcca aatgtatctt 
840 

tctacaaatg ttacttataa 
900 

tctttgacct cagggaqagc 
960 

ctaaacaacc aeatcctatc 
1020 

gagcaggagg gaggcaagcc 
1080 

gattcaactt tgttttcttt 
1140 

ctgtgtcttc atttctaaaa 
1200 

tcacagatgt gttgtgagga 
1260 

aaaatactta tagacagttt 
1320 

tgctgtgata ctaatgtgtt 
1380 

acatcagaca gctatctaag 
1440 

cqaccatgcc tggtgccaca 
1500 

ataggtctca aagactcttg 
1560 

gagcactaac agcagtagac 
1620 



ctggccgtgc ttggcttgag 
gtttggcaaa ggaaaagcag 
caaaaacatt ttgacacaat 
gcctctgccg taaatcctgg 
taaaccgtga actcaagcag 
ggcacataat gggtctgttq 
gtgggcctag agcagatata 
cttatcttat gtgagaataa 
ttctgctttt gaattgggca 
cacacacaca cacatctgaa 
acctgtccag glctgcctaa 
catggatagg tctagtcata 
gcctcttctc ggccatcaac 
tcctgtggcc ccagtqaaac 
tgggggtgat gctt tcatat. 
ttaataaaat tatgtctatg 
aactattaca tagatatata 
gttttaactr. attccataaa 
ctctggacta cccctttgtg 
gccatggttt ccatttctag 
gatcagaatc aggagattag 
gtagacctct gtcctttacc 



taactgtctc tctttcccca 
ataaaaacag aacattccat 
gtttgtgaaa cacct.tf.gga 
ggcaagggat cagcctcttc 
gcattttttt tttcttaccg 
ctcttattgg cttccaaatg 
ttcagcaagg tgacagcttc 
aatatttaag ggttgaacct 
gaagatttta gcaactatat 
atatatgccg aaaattgacg 
aggaaatggc tccagtgggt 
acactttaga gagaatgtca 
tgcagatgat gaaagagcgg 
ctcctgccct ccctgcacgt 
tgacctcacc ccatactacc 
gtattttcag tttctggaga 
agtgatctca gtttcttgtt 
atgacagttg tgtcctagcc 
cagctgaatc actgcagggt 
atgaaaggat ggcctaggac 
ggaaaacagg atggatacct 
atctgaggtc ttctggattc 
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tttgtggggt taattttgat 
1680 

gcatggttca atcccLcaca 
174 0 

gcacgttgat tcaccttctt 
1800 

aatgaatgta ccatttcaac 
1860 

aaccaagaga tcaatgccag 
1920 

tggaaaggtg ggaaaattct 
1980 

atattgcata ccaattataa 
2040 



aaaaa 
2045 



<210> 8 

<211> 2096 

<212> DNA 

<213> human 

<400> 8 

atggagaacg agcct-.gtagc 
60 

cggttcaaag tggtggattg 
120 

caggtgggcc acttgggaga 
180 



atccgcctct tccactcaga 
240 

cccatcatct gggtgcccct 
300 

cagggcaacg tccgactctt 
360 

tccatgttcc ccgggctctt 
420 

atccaccgct tcctgttcca 
480 

cacttcgtca tgcacggcca 
54 0 

ccccctgtgc cagcctccct 
600 



cccgaggcag tagggggcac 
660 

atgacccatt aclacctyca 
720 



ttgatgtcat ctgtttgccc 
tccaggaaat gaattttgca 
tgcctttaag ccttfittttt 
tttgatttta atagtgctag 
atttttctct tggggtaagt 
ttgatatttg atgtcattgt 
ttatatcaat taaagttgat 



ccttgaggaa actcagaaga 
ggacaaggac ctggtggact 
gaagtacgat gaglgggttc 
ccfccattgag ggcctctcta 
ggtgctgtat ctcagctggt 
cacgtcattt acaacagagt 
catgctgggg acattcctct 
catgaagccc cccagcgaca 
gcaccacaag gcacccttcg 
ggtgatcggc gtcttctact 
tgtgtttgcg gggggcctcc 
ctttggctcg ccgcacaagg 



ttcatcttgc ttgcaagtgt 
attgggccag atgctaattt 
cttttttttt ttttttggca 
ttgatattgg taataatgct 
tagctgaagt catttaaaga 
atccacattt gttgtaagac 
aaaagcttca aaaaaaaaaa 



cagatcctgc tatggaacca 
ggcgaaagcc tcLcctgtgg 
accagccggt gaccaggccc 
agactgtctg gtacagtgtc 
cctactaccg aacctttgcc 
acacggtggc agtgcccaag 
ggagcctcat cgagtacctc 
gctattacct catcatgctg 
acggctcccg cctggtcttc 
tgtgcatgca gctcatcctg 
tgggctacgt cctctatgac 
gctcctacct gtacagccfcg 
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aaggcccacc acgtcaagca ccactttgca catcagaagt caggatttgg tatcagcact 
780 

5 aaattgtggg attactgttt ccacaccctc actccagaga aaocccacct gaagacgcag 

840 

tgacaactcc caccccctcc gtcctgccct cagcccggcc ctggcccctt cccgaccccc 
900 

10 acccgccatt oagaccccat taagaaggtt ggcttggcca ggcaggatgg gctglgtccg 

960 

gccctgcagc ctagtggaag gtgctgaggg ggccctgayg caggaccgcc ctcctgaccc 
1020 

15 ctggtaggag ggtcacatcc acttggtgca ggtggccctt ggtgacccac ttcttcctgg 

1080 

agcgtccctg cctagagctc agcccacagg actgcttcag gccgtggcca caggtagcag 
1140 

20 ccgcaagggg aaatgaagaa aactgagccc tcgtggccac ctgtgtcacc cttgtgcctt 

1200 

agcctcatgg gctgcctagg agctgcctgc acggcacagc tcgctttcac agtcagaagt 
1260 

25 gggtctgtgg gatctgtgyt ccctgtcctc cctgctgtcc cttctgggga ggctttggtg 

1320 

gctctgaggt ggacaaagag ctctcgcaag aagagacagc gtgatgcctc ccacagtcca 
1380 

30 ccccagaccc tggggcagcc cctctggccc tgccagctgc ctgcgtcgtt gggcccaggg 

1440 

tggctggcag gagtcccagc tgcttgcttt aygacctggc agcttttctt gccgtccctc 
1500 

35 ccctgcctcc agaatcacag cccttctccc caagggaggc tgaggaggct tctccaccag 

1560 

tggcagcccc accccgtccc tggccattct tggcctccac cccgctcagg cccctacLcg 
1620 

40 ggcgctccca gaaggagcca cctctcagtg cctcacctcc ccctgcctcc cagcctccgc 

1680 

agatgaggtt cctgcccctt cctcctcgta accaaaaccc tcactgctcc caggacggtc 
1740 

45 ttatttataa accagataca tgttcttagt ctggtcccag accaaggagc tggtcagacg 

1800 

gccclttcta atcctacatg ttgagcttat gtaaaaaatg ttgttticctc ctgtttttgg 
1860 

50 ttcctttctt acccacaaac cattactact tgaaacttaa aaaactcgcc aagtgtaaag 

1920 

gctaaagaga agcagtttga cggaccttgt gatttgtact gtttgctgcg gagctattta 
1980 

55 aagattttgg aataaatata caaaactacg gttgtgaaat aaaaacttaa attgtatatt 

2040 



EP 1 355 149 A2 



ttgaaoaata aaacactgaa aagaaaccaa caaaaaaaaa aaaaaaaaaa aaaaaa 
209G 

5 

<210> 9 

<211> 5640 

<212> DNA 

<213> human 

10 <400> 9 

ggaaacgcag aaaacagaga gaggcattct gagtcatctg actggatgaa gactgttcca 
60 

agttacaacc aoacaaatag ctccatggac ttcagaaatt atatgatgag agatgagact 

120 

15 

ctggaaccac tgcccaaaaa ctgggaaatg gcctacactg acacagggat gatctacttc 
180 

attgaccaca ataccaagac aaccacctgg ttggatcctc gtctttgtaa gaaagccaaa 
240 

20 

gcccctgaag actgtgaaga tggagagctc ccttatggct gggagaaaat agaggaccct 
300 

cagtatggga catactatgt tgatcacctt aaccagaaaa cccagtttga aaatccagtg 
360 

25 

gaggaagcca aaaggaaaaa gcagttagga caggttgaaa ttgggtcttc aaaaccagat 
420~ 

atggaaaaat cacacttcac aagagatcca tcccagctta aaggtgtcct tgttcgagca 
480 

30 

tcactgaaaa aaagcacaat gggatttggt tttactatta ttggtggaga tagacctgat 
540 

gagttcctac aagtgaaaaa tgtgctgaaa gatggtcccg cagctcagga tgggaaaatt 
600 

35 

gcaccaggcg atgttattgt agacatcaat ggcaactgtg tcctcggtca cactcatgca 
660 

gatgttgtcc agatgtttca attggtacct gtcaatcagt atgtaaacct cactctatgt 
720 

40 

cgtggttatc cacttcctga tgacagtgaa gatcctgttg tggacattgt tgctgctacc 
780 

cctgtcatca atggacagtc attaaccaag ggagagactt gcatgaatcc tcaggatttt 
840 

45 

aagccaggag caatggttct ggagcagaat ggaaaatcgg gacacacttt gactggtgat 
900 

ggtctcaatg gaccatcaga tgcaagtgag cagagagtat ccatggcatc gtcagqcagc 
960 

50 

tcccagcctg aactagtgac tatccctttg attaagggcc ctaaaqggtt tgggtttgca 
1020 

attgctgaca gccctactgg acagaaggtq aaaatgatac tggatagtca gtggtgtcaa 
1080 

55 

ggccttcaga aaggagatat aattaaggaa atataccatc aaaatgtgca gaatttaaca 
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1140 

catctccaag tggtagaggt gctaaagcag tttccagtag gtgclgatgt accattgctt 
1200 

atcttaagag gaggtcctcc ttcaccaacc aaaactgcca aaatgaaaac agataaaaag 
1260 

gaaaatgcag gaagtttgga ggccataaat gagcctattc ctcagcctat gccttttcca 
1320 

ccgagcatta tcaggtcagg atccccaaaa ttggatcctt ctgaggtcta cctgaaatct 
1380 

aagactttat atgaagataa accaccaaac accaaagatt tggatgtttt tcttcgaaaa 
1440 

caagagtcag ggtttyyclt cagggtgcta ggaggagatg gacctgacca gtctatatat 
1500 

attggggcta ttattcccct gggagcagct gagaaagatg gtcggctccg cgcagctgat 
1560 

gaactaatgt gcattgatgg aattcctgtt aaagggaaat cacacaaaca agtcttggac 
1620 

ctcatgacaa ctgctgctcg aaatggccat gtgttactaa ctgtcagacg gaagatcttc 
1600 

tatggagaaa aacaacccga ggacgacagc tctcaggcct tcatttcaac acagaatgga 
1740 

tctccccgcc tgaaccgggc agaggtccca gccaggcctg caccccagga gccctatgat 
1800 

gttgtcttgc aacgaaaaga aaatgaagqa tttggctttg tcatccLcac ctccaaaaac 
1860 

aaaccacctc caggagttat tccLcalaaa attggccgag tcatagaagg aagtccggct 
1920 

gaccgctgtg gaaaactgaa agttggagat catatctctg cagtgaatgg gcagtccatt 
1980 

gttgaactgt ctcatgataa cattgttcag ctgatcaaag atgctggtgt caccyLcaca 
2040 

ctaacggtca ttgctgaaga agagcatcat ggtccaccat caggaacaaa ctcagccagg 
2100 

caaagcccag ccctgcagca caggcccatg ggacagtcac aggccaacca catacctggg 
2160 

gacagaagtg ccctagaagg tgaaattgga aaagatgtct ccacttctta cagacattct 
2220 

tggtcagacc acaagcacct tgcacagcct gacaccgcag taatttcagt tgtaggcagt 
2280 

cggcacaatc agaaccttgg ttgttatcca gtagagctgg agaqaggccc ccgyygcttt 
2340 

ggattcagcc tccgaggggg gaaggagtac aacatggggc tyltcatcct tcgtcttgct 
2400 

gaagatggtc ctgccatcaa agatggcaga attcatgttg gtgaccagat tgttgaaatc 



2460 

aatggggaac ctacacaagg 
2520 



ggaaataaag ttcttcttct 
2500 

gctccttccg gtctgfcgcte 
2640 

tcttggtctt tccttaaaaa 
2700 

tgtgatttat gatgaacagt. 
2760 

agagtctcac qtqccagtaa 
2820 

agaattaaag gacattgtgc 
2880 

aaagcatcag tctcttctcc 
2940 

gcacagtaac aagaaaaatc 
3000 



ggttagtccc aaaaagccag 
3060 

tcctctaaaa aataacccca 
3120 

ggaaaataaa agttgtcagg 
3180 



aagcagaggt cggtcggcca 
3240 

atcaaatgct gagcceaaal 
3300 

cagtaatgct gagcagatcc 
3360 



aaagcagaat cagttggaoa 
3420 

ggttgagaaa tctcttccat 
34 80 



aaatgaaaaa ggaaagaaag 
3540 

agaaaatgtc cagctatcag 
3600 

aaacaaaaca caagatcaca 
3660 

tggaaggttc aaaccggaaa 
3720 



ctggaaggtt ccaagtggaa 
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aatcacacat actcgagcaa 
tttgaqgcca ggaactggct 
ctacgtgaaa cccgagcaac 
gacttggtga ttgggatatt 
caccattacc cccatcttca 
ttgaagaatc tLlgagagtt 
ctgaaaagaa aagcacttta 
agaaaaatgt gagtaagagg 
tattaaaagt agaaaatggt 
ccagtcaaca ttcagaggaa 
aaagaagacc cagagatcaa 
tcagcaccag ggcaggctct 
gcccaaaaaa gcagcaaaaa 
tattagaggg taagagtcga 
cagatgggaa ggaaaaatca 
aaagcagaac aaggtctcca 
ccaaaatgac taataagact 
taaccacagg agaaacaagt 
aaaagaggct gaagcaagaa 
aagggaaaga actagaggca 
gcagttctcc agttaagaaa 
ataaagtcac aggcactaLL 



ttgagctcat tcaggctggt 
tgatacctga ccatggtttg 
attaaggctt tcagggcttt 
aataatcctt cgtcttcaaa 
cattttgctt ccatatttga 
cagatatgtg aaaaggcaga 
aatgaaaatc agcctgagat 
gatccaccca gcagtcatgg 
gttacacgaa gagglagatc 
catttggata agattcctag 
tccctcagcc ccagcaaagg 
ggacaagatc agtgcagaaa 
altgaaggaa gcaaagctcc 
agaatagcag gctatacggg 
gacgtcatca ggaaagatgc 
gagaaaaaaa tcaaaagaat 
acaagtaaag aagtatctga 
tctagtaacg ataaaatagg 
cctgaagaga aggtagtttc 
gctgacaaaa acaaagagac 
acactgataa ctccagggcc 
ggtatggctg agaaacggca 
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3780 

gtaaccttta gtataaaaca 
3840 

tccagaaaaa gccttttttt 
3900 

tgagcctcaa gttacetagg 
3960 

cccctggccg gctgccctcc 
4 020 

cccgccctgc tcaggctgcc 
4080 

gatattaaca atgtggtcac 
4140 

gtctgttttt atcctttttg 
4200 

tggaagttga tgatctcaat 
4260 

gcagaagaag agactgttaa 
4 320 

tgatgatgtc ccactttaaa 
4380 

atgtagtcca atattgatgc 
4440 

tttcaaataa ctgttaaaga 
4500 

aagcaataga gttcatctcc 
4560 

aggtcaccct ttttatttca 
4 620 



gtgtttattg ctctttgaag 
4 680 



ctactaaagt cattagtctt 
4740 



gattatgtga gggacttaat 
4800 



tagcaggtgc acaagcttgg 
4860 



aggacaaatg tgtagacagt 
4920 

ctttcttatt taaaaacaag 
4980 

agaattcttt ttaaaagtct 
5040 

ttccataaaa tagatccagg 
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aagaaaaaca agttgtaatc 
ttttttcaga tattctgaaa 
ctgcatgaag ggcctttagg 
ctcgctctca ggaaggagct 
cagctcgtct tcatgagtgt 
aactcacttt gtatttgtgc 
ttcagctgtt tccacagtaa 
agcatgttgc atgtttacag 
ctcatcgtta aagatggccg 
aataaaaccc ccaaacatca 
tttcttatgg ctttttattt 
tattacttac aattgaatgt 
tgctgtgtta tccaacctcg 
gttatttaat tatgaaacca 
aaaaaccacc aactatttct 
taatacataa tacatatttg 
catgaaacca gtttcacagt 
gtgtttaaaa acaacctgtg 
acttactgga tcttatttaa 
aaagggagac taaacatctg 
agttaaagat gtttct taga 
tttttcagtt ccctgaagca 



ttttcttaca gcagcatttt 
cagataagta catgttaatg 
attgctaaga accaactgtc 
gcatccacat gctcatctga 
ctgaacaaat gacatatgtt 
caagttatct actgtatcat 
tgaaaaagtt aggtttggct 
agagaaatat gtgagtcctt 
ttgtctcttc taacagctac 
ctactttaag gaaaaaaaaa 
taatttggct ggataagttg 
ttgaaataag aaagtactct 
atglatactt acagcatctc 
taaagaagca tgtggaaata 
ggatattttg gctgtaccta 
aaaagtaaac atattatata 
ccaagtacca actcttctgg 
tagggtatgc ccagcaaatg 
cttttagcta cattaactaa 
cttaacttgt acacattttc 
agttggagac tgttaacaac 
gcattcagta gcatctatat 
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5100 

aaataaaggc accttctgag aataaaacta ttttatggag tgtgtqaaca cacttgttct 
5160 

gtcacctggg ttcatcttgt tqtqaaqcac attaggtcca ggtccttccc tcLgygagtc 
5220 

tgactgtgaa actctltaae ccaacaactc aattagcccc tgtagataag acatgcttcc 
5280 

cagagtgaga tt tfctgaaat: ccccttttca tccagaacta tatttaccca cctattgtaa 
5340 

ctattcaaat agagcaaaat taggaggctt gataaatact aagaatttag taccacagaa 
54 00 

attatttatt attttccctg tagtccacaa Utagtgataa cgaatcctat ttttgttaac 
54 60 

tgtgacataa ctttgatgtc atatgttgtc ctatgtggtt cttcctaagt aaactctgta 
5520 

ctgattatat actgacttag caatgtggcc ttggaatgct gagcaaaatg tggatgtact 
5580 

ggttgtaaat gtttatatat tgtacagtac ctttatatat acacttgagg ttctgattag 
5640 



<210> 10 

<211> 457 

<212> DNA 

<213> human 

<220> 

<221> mi sc_ feature 

<222> {242} . . (242) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (369) . . (369) 

<223> any kind of base 



<220> 

<221> m.i.sc_feature 

<222> (394) . . (394) 

<223> any kind of base 



<220> 

<221> misc_f eature 

<222> (406) . . (406) 

<223> any kind of base 



<220> 

<221> misc_f eature 

<222> (457) . . (457) 

<223> any kind of base 



<400> 10 
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tcagtcactc tttcaccctg ccaaagcttc 
60 

aataaaatgt catcatatta agccactggg 
120 

tcttaattta cctaatacac acattgtgac 
180 

tacatgtcca qctttgagga tccctggcat 
240 

tncaattccc acttttaagg catctgtggc 
300 

ggaagggatt tatctggggc accataggaa 
360 

agggagtgng gaggaggggc cttcaggccc 
420 

ggtgqtttgg ggggttttcc ctcttacccg 
457 



<210> 11 

<211> 1493 

<212> DNA 

<213> human 

<400> 11 

aatacggttg gcggctgcag cgggcggcaa 
60 

tggaggctgc acagctcgac gggccacact 
120 

atgacctgga cagctgcaag cattccagct 
180 

tgtgggactg gactgtggcc ccacctgtcc 
240 

cagcagccgc ttttagccac ccccaggctg 
300 

gccctgcagg gaacctcgaa ctggccccca 
360 

cataccccac ggagaacttc gctagccaga 
420 

ccagccctgt gctatcagag gaggaagact 
480 

cggacagcga gtcggatgag gccctcgtgg 
540 

ggactcgcaa gaagctgcgc ctgtaccagt 
600 

tgcgtgagtg cgtgtggtgg gtggagccag 
660 

acaaagaact cctggcgcgc cgctggggcc 
720 



actgtcctac tgattgaatt gtatgtgaga 
atttgtatgt ttatctgtta tagcagcaag 
agatgttctt aatgtcccac cccatattgt 
gtgggggtag gagtttctgg gcatgctgga 
ctctgtggcc tctgtggcct tcactgttat 
actttaccat ggcacagtgg acaacctagg 
aacngggggg accagngttc gtggggttag 
tgggggn 

acagcccgcc cggcaccacc atgctcgccc 
tcagctgtct gtacccagat ggcgtcttct 
accctgattc agagggggct cctgactccc 
cagccacccc ctatgaagcc ttcgacccgg 
cccagctctg ctacgaaccc cccacctaca 
gcctggaggc cccggggcct ggcctccccg 
ccctggttcc cccggcatat gccccgtacc 
taccgttgga cagccctgcc ctggaggtct 
ctggccccga ggggaaggga tccgaggcag 
tcctgctggg gctactgacg cgcggggaca 
gcgccggcgt cttccagttc tcctccaagc 
agcagaaggg gaaccgcaag cgcatgacct 
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accagaagct ggcgcgcgcc ctccgaaact acgccaagac cggcgagatc cgcaaggtca 
780 

agcgcaagct cacctaccag ttcgacagcg cgctgctgcc tgcagtccgc cgggcctgag 
840 

cacacccgag gctcccacct gcggagccgc tgggggacct cacgtcccag ccaggatccc 
900 

cctggaagaa aaagggcgtc cccacactct aggtgatagg acttacgcat ccccaccttt 
960 

tgggataagg ggagtgctgc cctgccataa tccccaagcc cagcucgggc ctgtctggga 
1020^ 

ttccccactt gtgcctgggg tccctctggg atttctttgt catgtacaga ctccctggga 
1080 

tcctcacgtt ttgggtgaca ggacctatgg accactatac tcggggaggc agggtagcag 
1140 

tgcttccaga gtcccaagag cttctctggg attttcttgt gatatctgat tccccagtga 
1200 

ggcctgggac ctttttaaga tcqctgtgtg tctgtaaacc ctgaatctca tctggggtgg 
1260 

gggccctgct ggcaacccLg agccctgtcc aaggttccct cttgtcagat ctgagatttc 
1320 

ctagttatgt ctggggccct. ctgggagctg ttatcatctc agatctcttc gcccatctat 
1380 

ggctgtgttg tcacatctgt cccctcattt ttgagatccc ccaattctct ggaactattc 
1440 

tgctgcccct ttttatgtgt ctggagttcc ccaatcacat ctagggct cc tec 
1493 



<210> 12 

<211> 2292 

<212> DNA 

<213> human 

<400> 12 

ccatgggttc cccttcagcc tgtccataca gagtgtgcat tccctggcag gggctcctgc 
60 

tcacagcctc gcttttaacc ttctggaacc tgccaaacag tgcccagacc aatattgatg 
120 

tcgtgccgtt caatgtcgea gaagggaagg aggtccttct agtagtccat aatgagtccc 
180 

agaatcttta tggctacaac tggtacaaag gggaaagggt gcatgccaac: tatcgaatta 
240 

taggatatgt aaaaaatata agtcaagaaa atgccccagg gcccgcacac aacggtcgag 
300 

agacaatata ccccaatgga accctgctga tecagaaegt tacccacaat gacqcaggat 
360 

tctataccct acaegttata aaagaaaatc ttqtqaatga agaagtaacc agacaattct 
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420 

acgtattctc ggagccaccc aagccctcca tcaccagcaa caacttcaat ccggtggaga 
480 

acaaagatat tgtggtttta acctgtcaac ctgagactca gaacacaacc tacctgtggt 
54 0 

gggtaaacaa tcagagcctc ctggtcagtc ccaggctgct gctctccact gacaacagga 
600 

ccctcgttct actcagcgcc acaaagaatg acataggacc ctatgaatgt. gaaatacaga 
660 

acccagtggg tgccagccgc agtgacccag tcaccctgaa tgtccgctat gagtcagtac 
720 

aagcaagttc acctgacctc tcagctggga ccgctgtcag catcatgatt ggagtactgg 
780 

ctgggatggc tctgatatag cagccttggt gtagtttctg catttcggga agagtgtttt 
840 

tattatccac ctgcagacLg gactggattc ttctagctcc ttcaatccca ttttctcctg 
900 

tggcatcact aagtataaga cctgctctct tcctgaagac ctataagctg gaggtggaca 
960 

actcaatgta aatttcaagg aaaaaccctc atgcctgaqa tgtgggccac tcagagctaa 
1020 

ccaaaatgtt caacaccata actagagaca ctcaaattgc caaccaggac aagaagttga 
1080 

tgacttcatg ctgtggacag tttttcccaa gatgtcccaa gcctcatcgt gacgaggctc 
1140 

ttatcccact ccatttttcc ctgctcatgc ctgcctcttt aatttggtaa gataatgctg 
1200 

taactagaat ttcacaatca gcgccttgtg caggcaattt gacagagtgt tggatgtgtc 
1260 

atgtcatcal gLcaaaccca aatatttgac ctaagggatc ctttattctg cccagtggch 
1320 

aactttaaca acatccr.taa tacaactgtt tattcaaatg cacggtggtc cctgttagag 
1380 

ttagacctct agactcacct gttctcacgc cctgttttaa tttaacccag ctatgggatg 
1440 

ccagataaca gaattgctgc ctacgagctg aacagggagg agtttgtgca gttgctgaca 
1500 

cttcttgttg cacataaata aatacagtgg gtactataga gactcagttg caaaaattaa 
1560 

caaatalgct gcttgattaa aatgggtagg cttctcatgt ggctcattct ttaatctatt 
1620 

ctcttttatt tggtttggtc catgg'ggtct ctgcctatgg atcatacttc aaactcttgg 
1680 

tgtgatcctc ctgattgtca caatattagt taccctggtg tgctgtattc tct:aaaacct 
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1740 

ttaaatgttt gcatgcagcc attcgtcaaa tgtcaaatat Lctctctttg gctggaatga 
1800 

caaaaactca aataaatgta tgattaggag gacatcataa cctatgaatg atggaagtcc 
1860 

aaaatgatgg taactgacag tagtgttaat gccttatgtt tagtcaaact ctcatttagg 
1920 

tgacagcctg gtgactccag aatggagcca gtcatgctaa atgccatata ctcacactga 
1980 

aacatgagga agcaggtaga tcccagaaca gacaaaactt tcctaaaaac atgagagtcc 
2040 

aggctgtctg agtoagcaca gtaagaaagt cctttctgct ttaactctta gaaaaaagta 
2100 

atatgaagta ttctgaaatt aaccaatcag tttatttaaa tcaatttatt tatattcttc 
2160 

tgttcctgga ttcccatttt acaaaaccca ctgttctact gttgtattgc ccagtaggag 
2220 

ctatcactat atLLtgcaga atggaaactg ccctgactct tgaatcacaa at.aaaagcca 
2280 

attgtatctg tt 
2292 



<210> 13 

<211> 519 

<212> DNA 

<213> human 

<220> 

<221> misc_feature 

<222> (212) . . (212) 

<223> any kind of base 



<220> 

<221> misc_t'eature 

<222> (451) (451) 

<223> any kind of base 



<400> 13 

qaaacaacaa cagtgtaatc tttaacaggg atgttaaagg taagaagtca ggaagataaa 
60 

ccaaaatgat tgagtatgat aaagaatttt gcatggcgat taaaatagaa aacctataaa 
120 

tgtagaaaaa gcaggtctgg acttagcaaa gaaacaatat agtttggaga aggcatgaaa 
180 

taagttcttt tcatgttcac tgctggtcac ancataacag agagtgatgt ggagagcttt 
240 

gggaaggttl cacgttgagt tacatcagtg gtcaacaatg gagcaacaag actccgtaca 
300 
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ggatgccacc ctgggagaat tgcaagggaa aggaggctga agcacaactg gtaatagcct 
360 

tcagatattt aatggatatg caaataaagc tctgattaat tgtattttca cttattatat 
420 

atcatctttg gacctttcta aaagtgggac nctagaaaag atatactgaa actccaaaag 
480 

aatacttcag ctcgagttga atggattcaa gatgttgtt 
519 



<210> 14 

<211> 5294 

<212> DNA 

<213> human 

<400> 14 

ggctcgcatc cccatagtgc tgggttacag tgaaggtacg ccccgcgctc tgctctggag 
60 

aggcagggtg ggatagggaa cgtctcgagt ggcgcccgca gtcatggtgg tgttcgttgg 
120 

ccgccgcctc ccggcgctcc tagggctgtt taagaagaag ggctctgcna aggctgagaa 
180 

tgacaaacat ctaagtgtag ggcctggcca ggggccaggg tctgcagtgg atgagcacca 
240 

ggacaacgtc ttctttccca gtgggcgacc cccccacctg gaagagctgc acactcagqc 
300 

ccaggagggg ctccgctccc tacaacacca agagaaacag aaactgaaca agggtggctg 
360 

ggaccatgga gacacccaga gf.atccagtc ctcccggacg gggccggatg aagacaacat 
420 

ctccttctgc agtcagacca catcctacgt ggctgagagc tccacagcag aggacgcgct 
480 

ctccatccgc tcggagatga tccagcgcaa aggctccacc ttccgacccc atgactcatt 
540 

tcccaaatct ggaaagtcag ggcggcgtcy gcgggagcgg cggagcactg tgctgggact 
600 

cccgcagcat gtgcagaagg agcttggcct gaggaatgag cgtgaggcac caggcacgcc 
660 

ccgggctcct ggtgcacggg atgccgtacg catccccaca gtggacggcc gcccccgagg 
720 

cacctcaggg atgggggccc gggtgtccct gcaggcgctg gaggcggagg cagaggctgg 
780 

cgctgagaca gaggccatgc tgcagcgcca cattgaccgt gtctaccggg atgacacctt 
840 

tgttggccgg tccacgggta cccgggcccc accattgacc cggcccatgt ccctagcagt 
900 

gcctggattg acaggagggg cagggcctgc agagcccctg agcccggcca tgtccatctc 
9G0 
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cccccaggcc acctacctgt cgaagttgat 
1020 

cgtggtggcc ctaggccgct gcagcctgcg 
1080 

caycccagcc tcagtccgct cgctggggcg 
1140 



cagccgccac ccatcctcct ccagtgacac 
1200 

tgtgtctgac ggttccaccc tctcctctaa 
1260 



tacggctagc aatagcgtgg taccccctcc 
1320 



tgggggcagc actgctgagg cctcagacac 
1380 



tggccggagt gtgtccctgc gtaagctgaa 
1440 



ctccctccat cagcggggct tagcagtgcc 
1500 



tgagcgtaag cagcagcccc agctgcctcg 
1560 



gggggcagca ccctgtccac ccaacccagc 
1620 



tggttcccgg cgccccccac ggtccccaga 
1680 



gagccaaagt ggtactccca ccctccctcc 
1740 

aggcaaggcc cagcccccta aaccagagcg 
1800 

cgtctcctct tccctcacgt chttatgttc 
1860 



ctctgggcca cagatattga cccccctggg 
1920 



ggtgcctgcc cccttctccc cacctccctc 
1980 



tgctctagcc gcccctgctg tggttcctgg 
2040 

gtcccctccc actccccaga caaccttgac 
2100 



agaccagtca cccccacctt ccccaccccc 
2160 



gaagccagag gtggttgtgg aggcaccatc 
2220 



agatcccaac tggccccctc ccccaccccc 
2280 



tccacatgct gtgctgccgc ctaragtgga 
cacactaagc cgctgcagcc tgcactcggc 
cttctcctcc gtctccagcc cacagccccg 
ctggagccac tctcaatcct ccgacaccat 
gggtggctct gagggccagc cggagagctc 
ccagggaggc agtgggaggg gctctcccag 
actcagcatt cggagcagtg ggcagttgtc 
gcggcctcca ccccctcccc gccggaccca 
tgatgggcca ttagggttgc cccctaagcc 
gccacccacc actggtggct cagaaggggc 
caacagctgg gtacctggct tgtctccggg 
acggacactt tcgccctcca gtggatactc 
caagggcctg gcaggtcccc ctgcttcccc 
tgtcacgtct cttcgctccc ctggggcctc 
ctcctcctct gacccagccc cctcagaccg 
tgacaggttt gtcatacctc ctcaccccaa 
caagcccagg agccctaacc cagctgcccc 
gcctgtttct accactgacg ccagtcctca 
tccactgcag gagtctcctg tcatctccaa 
atcttatcat ccacccccac cacccaccaa 
tgcctcagag actgctgagg agcccctcca 
tgcccctgag gagcaggacc tgtccatggc 
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tgacttcccc ccaccagagg aggctt tttt. ctctgLggcc agccctgagc ctgcaggccc 
2340 

ttcaggctcc ccagagcfctg tcagctcccc ggctgcttcg tcctcctcag ctactgcttt 
2400 

gcagattcag cccccgggta gcccagaccc tcctccagct ccgccagccc cagctcctgc 
24 60 

tagttccgcc ccagggcatg tggccaagct ccctcagaag gaaccggtgg gctgtagcaa 
2520 

gggtggtggg cctcccaggg aggacgtagg tgcgcccctg gtcacgccct: cgctcctgca 
2580 

gatggtgcgg ctgcgctccg tgggtgctcc aggaggggct cccaccccag cactggggcc 
2640 

atcggccccc cagaaaccac tgcgaagggc cctgtcaggg cggqccagcc cagtgcctgc 
2700 

cccctcctca gggctccatg ctgcggtccg actcaaggcc Lgcagcctgy ccgccagtga 
2760 

aggcctctca agtgctcagc ccaacggacc gcctgaggca gagccacggc ctcr.ccagtc 
2820 

ccctgcctca acggccagtt tcatcttctc caagggctct aggaagctgc agctggagcg 
2880 

gcccgtgtcc cctgagaccc aggctgacct ccagcggaat ctggtggcag aactccggag 
2940 

calctcagag cagcggccac cccaggcccc aaagaagtca cctaaggctc ccccacctgt 
3000 

ggcccgcaag ccgtctgtgg gagtcccccc acccgcctcc cccagttacc ctcgagctga 
3060 

gccccttact gctcctccca ccaatgggct ccctcacacc caggacagga ctaagaggga 
3120 

gctggcggag aatggaggtg tcctgcagct ggtgggccca gaggagaaga tgggcctccc 
3180 

gggctcagac tcacagaaag agctggcctg accaccaggc acctcactgg cactgctgac 
3240 

ccatcccaga aacacaatct cagggacccg agcagctcca aggacgagag gatacagcag 
3300 

acacaaccta atagagaggg cgcctgcagc cttaacctcc acggccttcg atacttatgc 
3360 

aagcctggtg ttgctcctgt cctcagagtc atcctgcgct catgcctttt cccgaatqgg 
3420 

ttcacctctg gcagttgccg cttcagtctt ggccttagcc tcatcttgaa gtgygtagel 
34 80 

ggcgggagag ggtggctgog ccccctgctg gccctgaggc tgcagagttg ggagcaggac 
3540 

acctcacctg agtttcattt tttttcatgt ccaaaccatg cacatactat agtccagaat , 
3600 
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caaagcactt ttgaaaagtg gctgcatggc catcct.ccag ggcccaggaa gttgcattcc 
3660 

aagggcctgt ttacatggca gcagaatcca tccccggcag tcagcccata gcttgggacc 
3720 

agtctgtgcc ctcctgccca gtccagttta ctcctcttgg ttcctgaagg tggccaagtc 
3780 

attgtgttcc cacaggcLtc tctaggctgg gggcaggtgt ggggctgtgg aattccaaag 
3840 

cacaaaaggt gcagagggga ttggccttcc tgtgcctcaa ctcaccaacc accctcctgc 
3goo 

cttccagttc tgccaggtgc tccatgctgg ggacaagtag gagactgcca gggcccaaag 
3960 

aaaugggtga gcagtagagt catctcgggg cacttggcag tgtcaagcac ctgccccttg 
4020 

cctccttgac cacactgggg tgggtgggcc cccagcactt cagaggcagg agcctttggg 
4080 

ctgagcaagc actgaggagg tggatggaag ggagcatctg gaggggggga gcttccttga 
4140 

gcagtgggcc caggcctggc cctccacact tcattctctg acctttctct ctcctcattt 
4200 

cggtgcatgt cctttctgca gctgcctttc agcacaggtg gttccactgg gggcagctaa 
4260 

cgctgagtga caaggatggg aagccacagg tgcattttac tcaagtcttc tctagtcaot 
4320 

gaggggcacc cagtgcttct agggcaggct gggtggtggt cccctaggta tcagcctctc 
4380 

ttactgtact ctccgggaat gttaaccttt ctattttcag cctgtgccac ctgtctaggc 
4440 

aagctggctt ccccattggc ccctgtgggt ccacagcagc gtggctgccc cccagggcca 
4500 

ccgcttcttt cttgatcctc tttccttaac agtgacttgg gcttgagtct ggcaaggaac 
4560 

cttgctttta gcttcaccac caaggagaga ggttgacatg acctccccgc cccctcacca 
4620 

aggctgqgaa cagaggggat gtggtgagag ccaggttcct ctggccctct ccagggtgtt 
4680 

ttccactagt cactactgtc ttctccttgt agctaatcaa tcaatat tct tcccttgcct 
4740 

gtgggcagtg gagagtgctg ctgggtgtac gctgcacctg cccactgagt tggggaaaga 
4800 

ggataatcag tgagcactgt tctgctcaga gctcctgatc taccccaccc cctaggatcc 
4860 

aggactgcgt caaagctgca tgaaaccagg ccctggcagc aacctgggaa tggctggagg 
4920 
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tgggagagaa cctgacttct 
4980 



tgttaggatc ttctgagctt 
5040 



gggtctagaa qaggcagccc 
5100 



atggagagac caaaagcctc 
5160 



catatatata tttctttaaa 
5220 



ccctgaagcc tgagtgagac 
5280 



atgattgaaa cttg 
5294 

20 

<210> 15 

<211> 988 

<212> DNA 

<213> human 

25 <400> 15 

gtcgtgaggc gggccttcgg 
60 



cattggctct gggaagcggc 
120 

30 

ccatggcggg cgcgttggtg 
180 

actacctcat gagtacgcac 
240 

35 

ccatcaatga tatgaaaaag 
300 



gttgctattc tttgacattc 
360 

40 

tgtttgcatg ccacgcaaca 
420 

aacacgagat gactaaaacg 
4 80 

45 

aagggacagc attgccagct 
540 

ggaaaataca ctgaatgcta 
600 

50 

Lttctaaacc aactctctgc 
660 



agtagctcaa aatatgcaat 
720 

55 

tgtatatcot ctattagaat 



ctttccctct ccctcctcca acattactgg aactctatcc 
gtttccctgc tgggtgggac agaggacaaa ggagaaggga 
ttctttgtcc tctggggtaa atgagcttga cctagagtaa 
tgatttttaa tttccataaa atgttagaag tatatatata 
tttttgagtc tttgata tgt ctaaaaatcc attccctctg 
acatgaagaa aactgtgttt catttaaaga tgttaattaa 



gctggctcgc cgtcggctgc cggggggttg gcctgggtgt 

agcagaggca gggaccactc ggggtctggt gtcggcacag 

cggaaagcgg cggactatgt ccgaagcaag gatttccggg 

ttctggggcc cagtagccaa ctggggtctt cecal tgctg 

tctccagaga ttatcagtgg geggatgaca tttgccctct 

atgagatttg cctacaaggt acagcctcgg aactggcttc 

aatgaagtag cccagctcat ccagggaggg eggcttatea 

geatctgeat aacaatggga aaaggaagaa caaggicLtg 

getgetgagt cacagatttc attataaata gcctccctaa 

tttttactaa ccattctatt tttatagaaa tagctgagag 

tgccttacaa gtattaaata ttttacttct ttccatiaaag 

taatttaata atttcLgalg atgLLttatc tgcagLaatia 

ttacttaatg aaaaactgaa gagaacaaaa tt.tgt a acca 
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w 



780 

ctagcactta agtactcctg attcttaaca ttgtctttaa tgaccacaag acaaccaaca 
840 

gctggccacg tacttaaaat tttgtcccca ctgtttaaaa atgttacctg tgtatttcca 
900 

tgcagtgtat atattgagat gctgtaactt aatggcaata aatgatttaa a tat t tgtta 
960 

aaaaaaaaaa aaaaaaaaaa aaaaaaaa 
988 

<210> 16 
15 <?.ll> 4 908 

<212> DNA 
<213> human 

<400> 16 

ggataacctc gcagggtggg ccggagggcg ggcgccgccg ctgcctgtgc Igcggcgatg 
20 60 

gcccagtgtg tacaatcagt gcaggagcta atcccggact ccttcgtccc ctgtgtcgct 
120 

gcgctgtgca gcgacgaagc cgagcggctc actcgtctca atcacctcag cttcgcggag 
25 180 

ctgcttaagc ccttctcccg cctcacttcc gaggttcaca tgagagatcc taataatcaa 
240 

cttcacgtaa ttaaaaattt gaagatagca gtaagcaaca ttgtcaccca gccacctcag 
30 300 

cctggagcca tccggaagct tttgaatgat gttgtttctg gcagtcagcc tgcagaagga 
360 

ttagtagcta atgtgattac agcaggagat tatgacctta acatcagtgc cactactcca 
35 420 

tggtttgagt cttacagaga aacctttctt cagtcgatgc cagcatcgga tcatgaattt 
480 

ctgaaccact atttagcatg tatgttggta gcgtcatcta gtgaagctga acctgtggaa 
40 540 

cagttttcaa agttgtcaca agaacagcat cgaattcagc acaacagtga ttattcctac 
600 

cccaagtggt ttataccaaa tacacttaaa tactatgtac ttttacatga tgtaagtgca 
45 660 

ggagatgaac agagagctga atcaatttat gaagaaatga aacagaaata tggaactcag 
720 

ggttgctatt tacttaaaat taattctega acatctaatc gagcatcaga tgaacayata 
50 780 

ccagatcctt ggagtcagta tctccagaaa aatagtattc aaaancagga atcatatgaa 
840 

gatggccctt gtactataac ttcaaataag aattctgata ataacttgct ttoattqgat 
« 900 
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ggattagata acgaagtcaa 
960 

ttggagcaat ccagtgaccc 
1020 

tcatcgttac atgaaacaaa 
1080 

cttactgatc atgatagaat 
1140 

ccacatatag agaaaacaat 
1200 

agtcgatctc tattttctgc 
1260 

agcattaatg acctgaaaaa 
1320 

, caaatcagga aaatggctga 
1380 

tgctatcata ctgcaaagaa 
1440 

gccttggaaa tggcagcagt 
1500 

gctcattaca tggatacagc 
1560 

gctgaaagat gtgtgttgct 
1620 

gctgcagctc tcctaatacg 
1680 

ttggaacagg cagcacattg 
1740 

tttcatatga tattggcagg 
1800 

cgctgttatt gtcaagccat 
1860 

cacattaatt tcactattgg 
1920 

tctgctttta ggcatattct 
1980 

ttcctcagag aatatcttta 
2040 

ttgccacagc ttcctttacc 
2100 



gacagacgac cagcggatgg 
2160 

tatgaltctg aatcctctca 
2220 



agatggctta ccaaataact 
ttctaacagt attgatggcc 
gaaaggaaat actggaataa 
tcgacagttt atacaagagt 
taggcaatta aacgatcagc 
aactaaaaaa tggtttagtg 
tacatctggc ttgctgtatc 
cttatgtttt ttggtgcagc 
agattttctt aatgatcaag 
gtctgctttt cttcaaccag 
aattcagaca tacagagata 
tagtgctgaa cttttaaaaa 
gttgaccagt gaggattctg 
ctttataaac atgaaaagtc 
ccatcgattt agtaaagcag 
gcaagtttac aaaggaaaag 
gcgccagtcc tatactctta 
aattaatgaa agtaaacaat 
tgtttacaag aatgtaagtc 
gfcatattaac agttcagcaa 
tgaaaaacaa gcagctactc 
gcagtggcga gaacttgaqg 



ttagagctca cccacttcag 
cagatcatct aagatctgct 
Itcatggtgc atgtttaaca 
tcacatttcg gggccttttg 
taatatcaag aaaaggtttg 
gcagtaaagt tccagaaaag 
ctccggaagc accagaactt 
attatgattt ggcttacagt 
caatgcttta tgcagctggt 
gagcacctag gccatatcct 
tctgcaagaa tatggtgttg 
gccaaagcaa atattcagag 
atcttcgaag tgcacttctt 
ccatggttag aaaatatgca 
ggcagaaaaa gcatgcttta 
gctggtctct tgcagaggat 
gacagctgga taatgctgtg 
ctgctgctca acagggggct 
agctgtcacc agatggtcct 
cacgggtttt ttttggccat 
atgtaagtct tgatcaagaa 
aacaagttgt ttctgtggtt 
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aacaaaggag taattccatc caattttcat 
2280 

gataattcaa gatttccact tgcagttgta 
2340 

agaaaccctt tgaaagttct acttttgttg 
2400 

cctaaagatt tcagtggaaa ggataatgaa 
2460 

gaaatgattg gagctgaagt tatttcagag 
2520 

gcaagactaa agctctttcc ccatcacata 
2580 

aatcttggca ctattcaggg ctctatgaca 
2640 

cacacaggaa aatattcctt gagtatgtca 
2700 

ggtcctcgac ttaacaacac aaaagaagag 
2760 

cgtttagatc ccataatcac agaagaaatg 
2820 

cctacagggc ttctctgtgg agaaatccga 
2880 

aaatgtccac ttactggatt gaaggttgtt 
2940 

ggtaatactg ctgttctaac accactaagt 
3000 

aagactgttg tgacagatgc tacctctgtg 
3060 

gtagactttg gcattggcac aggaagtcaa 
3120 



actgttcttc tacccggagc ctcagtgcag 
3180 

gaaggtgtcc atgaaattaa ctttttgttt 
3240 

atacggcaca gaatattaag acaeactgca 
3300 



cgggccactg tctgcagaag taattctctt 
3360 

ctagtctttg tggatgtgga aaataccaat 
3420 

atagLgcaag tatcaagtag tagcaaacac 
3480 

gaaaacaaag atgccaaact tgccagtagg 
3540 



cccacacaat actgtttgaa cagttactca 
gaagaaccaa ttacagtgga agtggctttt 
actgatttgt cattgctttg gaagtttcat 
gaagttaaac aactagttac aagtgaacct 
ttcttaatta atggcgaaga atcaaaagtg 
ggggagctgc atattctggg agttgtttat 
gtagatggca ttggtgctct tcccggatgt 
gtccgaggga agcaggattt agaaattcaa 
aaaacatctg ttaaatatgg ccclgatcga 
ccactgttgg aggtgttctt tatacatttt 
aaagcatatg tagaatttgt caatgtcagc 
tctaaacgtc cagagttctt tactttcggt 
ccctcagctt ctgagaattg tagtgcttac 
tgtacagcac tcatatcatc agcttcttct 
ccagaggtga ttcctgttcc ccttcctgac 
ctgccaatgt ggttacgtgg gcctgatgaa 
tactatgaaa gtgtcaaaaa gcagccaaaa 
attatttgta ccagtcggtc tttaaatgta 
gaaaatgaag aaggcagagg aggcaatatg 
actagtgaag caggcgttaa ggaattccac 
tggaagttac agaaatctgt aaatctttct 
gagaagggaa agttttgctt taaggcaata 
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agatgtgaga aagaagaagc ggccacacag tcctctgaaa aatatacctt tgcagatatc 
3600 

atctttggaa atgaacagat aataagttca gcaagcccat gtgcagactt cttttatcga 
3660 

agtttatctt ctigaattgaa aaaaccacaa gctcacttgc ctgtgcatac agaaaaacag 
3720 

tcaacagagg atgctgtgag attgattcaa aaatgcagtg aggtagattt gaatattgtc 
3780 

atattatgga aggcatacgt tgtggaagac agtaaacagc ttattttgga aggtcaacat 
3840 

catgttaltc ttcgcactat aggaaaagaa gccttttcat atcctcagaa acaggagcca 
3900 

ccagaaatgg aactattgaa atttttcagg ccagaaaaca ttacagtttc ctcaaggcca 
3960 

tcagtagagc agctttctag tctcattaaa acgagtcttc actacccaga atcatttaat 
4020 

catccatttc atcaaaaaag cctttgttta gtaccagtca ctcttttact ttccaattgt 
4080 

tctaaggctg atgtagatgt catagttgat cttcggcata aaacaacaag tccagaagca 
4140 

ctggaaatcc atggatcatt cacatggctt ggacaaacac agtataaact tcaacttaaa 
4200 

agccaggaga ttcacagtct gcagctgaaa gcatgctttg ttcatacagg tgtttataac 
4260 

cttggaactc ctagggtatt tgccaagtta tcggaccaag ttacagtgtt tgaaacaagt 
4320 

cagcagaatt ccatgcctgc cctgatcatc atcagtaatg tgtgacaact tggaaatttg 
4380 

tactgaaatc cacaataatc agtttttgct ggatgggttt tacagcagta tttgatatac 
4440 

ctaacttgtt atggaggttg altgatatct gatccctgca aaatactttg acttgtcatt 
4500 

ttgttgatga tgcaaagcac gttggactga gaatacttaa cattcttttt ctgtatttct 
4560 

ttaaaccctg agaataattt acatgctcat aatacaggat atcagcatat ttgtgcacct 
4620 

tattaagccc catcttaaga aaacacaaag tctaagtctg ctgttacaac ttgtcaatgg 
4680 

tatacgaata ttaggagatg attctgagaa aggaaaggcc ttgttggcag tactcctgtt 
4740 

aagccattag tctctaaatt ccagctttac tgtgaagttc tatagagtgt taaatacaaa 
4800 

ttttcctgtc ttgcttcaca cagttcctta aaatcagttt tgaactttgg tcatagagtc 
4860 
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tlcalatttc agtatttggt ggtccctatg acttatacat aactttgt 
4908 



<210> 17 

<211> 435 

<212> DMA 

<213> human 



10 



<220> 

<221> misc_feature 

<222> (30).. (30) 

<223> any kind of base 



15 



<220> 

<221> misc_feature 

<222> (49).. (49) 

<223> any kind of base 



20 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(75).. (75) 
any kind of base 



25 



<220> 
<221> 
<222> 
<223> 



misc_feature 

(76).. (76) 

any kind of base 



30 



<220> 
<221> 
<222> 
<223> 



misc_feaLure 

(78) (78) 

any kind of base 



35 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(79).. (79) 
any kind of base 



40 



<220> 
<221> 
<222> 
<223> 



miscf eature 
(109) . . (109) 
any kind of base 



45 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(136).. (136) 
any kind of base 



50 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(137) (137) 
any kind of base 



<220> 

<221> misc_feature 
<222> (149) . . (149) 
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<223> any kind of base 



<220> 

<221> mi sc_ feature 

<222> (227) (227) 

<223> any kind of base 



<220> 

<221> inisc_feature 

<222> (236) . . (236) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (246) .. (246) 

<223> any kind of base 



<220> 

<221> misc__feature 

<222> (342) . . (342) 

<223> any kind of base 



<220> 

<221> raisc_feature 

<222> (363) . - (363) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (389) (389) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (426) . . (426) 

<223> any kind of base 



<400> 17 

ggtagaaatg attgtgatgt acaaattttn 
60 

gactcacatt cattnncnna atagtatcac 
120 

gattgctctg aaaatnnctg aagagttgnc 
180 

cccaaagctc tcctgccctt tgcagaaaga 
240 

gcatgnggtt gaacttcgcc aaaatccttc 

300 . 

ttgggcttaa agggcattga gtaagcaagt 
360 

ttntagataa tttccaaata ttacaattng 
420 



tattttgatc ahacttaana agacagagca 
tgtacacata gcgaatttnt ggcgctttta 
catagcagcc tggtaagcct tttcctttcc 
ctgttggtga caactgntgc taactnaata 
cacctcclcc catagggcaa caggggtgac 
aggttatcag anaacagagg gaagattcca 
tggaactcag agttcaactg ctcagttcct 
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tcttcngctg accct 
435 



<210> .18 

<211> 2224 

<212> DNA 

<213> human 

<400> 18 

ctttagatct gtgcagcctt tgcgtgccaa acttgtgaaa ttccttttac cttttttgga 
60 

gtacttgcta taaagccacc tgtcaacaaa cccccattat gtacagaata ggacctatcc 
120 

agtagccagg ccagtaggca gttggggaag gtgggaagga tccagcgagg cccctgagcc 
180 

tgcacctgga caggtgtacg tctgcaccca tcaccctcag caccaggcca ccctgcagtc 
240 

cacttactgt actgtgttgt ggaaggatat gctaagtgat gaaagttgcg agcagtctca 
300 

ctggtcgtgt aaactttttt ttttttttgg aaattgaagc tgtagagtgc tgcccgaaat 
360 

ctctaggaag ttggtggcaa gggacagcac tcacactctt ctggtcatga tctctgatct 
420 

ccacctcaoa tgacaataaa aaactggtcc aacgaagaca ctgctcagca cttcagccat 
480 

caggactaat ccatcgatga ctggaaaaga ggctagcttt gaggaaaaca gcctgggctc 
540 

ttgggagcag agtccagtgg gtgtgaggct gacttgccga cggtcggcag gtaatggctc 
600 

tcagccggcg aggcggtccc acagctctcc tcccagggca gcctgaggag gaggaggccg 
660 

ggtgcctgtt tggtggcagc ttcagcctag ggatacctga agctgttgag caacaccttt 
720 

atgaaatgtt gccagagcag caacacttcc ctgtgggcac agccccggga aatccggtac 
780 

caagtgagca aggtggcagg acccacccaa gcctgatacg catctgggcc cgccgggctc 
840 

agcaggggag gctgctacgg ctgcccactt cccagcaccg tctgtcaggc ttgaacccct 
900 

ctgtgctgtt cccttcctgg ctaataggga gacccttcgc aggcacccac tgtttcaact 
960 

tgaccctccc accccctgct actctcctcc acacacccct ccgttccgct agcctaccct 
1020 

glcagccttt caataaaagt tatgcacaaa tgtgaacacc tgagatggag ctgaacattt 
1080 

cttcactttg ttctttttct gaagtcaaac tcttatcaaa tgccctaaaa ttattaccac 
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1140 

ccaagagaaa caggaaaaag gttacatgtt tttgtttact gagagtaaga tcacctgcat 
1200 

ctggaagacg ggctggtaaa ttggtttggc tacagaacag aaagaaaaca aaaacaaacc 
1260 

tcgtaaggga agtatcgcac tcagacacca ccacttccta gagccaaatg agcaatccca 
1320 

aactgcaagt gccgtaagtg ggcctgtgac gtcacaccgc ccggcccgag gtatcgcatg 
1380 

tgcgggggag gcccacacta cagctgtcct ctcgtctaga aggcaccacc tcgctttcat 
1440 

gtcccgtgtg ttttggaaaa gcagtatggt gtgtcatgtc tagcggcgaa cacttccctc 
1500 

cctctgtcct tgaggttgta atataaaaac tgtgtttctg tacgtgtggg tgggaattct 
1560 

ctgacggtgc tcgttcatag cacaagctta cgctgagttc Lgaactgtcg ttcacagctg 
1620 

cgtgtctgca tggtgtcgca tctgttgtac ctttggggaa aatttgtatg taaatgtaca 
1680 

gaaataaaaa cgttgcccca ttaacagatt tcctctggaa tgtcttccct acctcacctg 
1740 

atggtatcca ccgaagggca tttcactacc attaatggtg agtaataaaa tcctccgtgt 
1800 

tcattcagac ctcactgcgt cactactttg aacgcctctg taagctgtgt cttcacccgc 
1860 

cccgaggtgg gtggagggag gcctctcact ctgcttcgag tcctggtctt aaaggtagtc 
1920 

agaggcagag gctggattaa acacacactg tttaccaagt gccactctca gaccacctga 
1980 

gagacggggg gccatcagta aaattaagag gaattttttt cccttgttcg tgtatgttct 
2040 

gctgatccgt ggcctgaagg ttcctagaga cgtcaagaaa Ugaatatctt acactgtgat 
2100 

tctgtgagga aagactggta acccaaaact ctcttctcta atgtattttt taacgaaaat 
2160 

gacaatattl ctttaataaa gtatttatac caaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
2220 

aaaa 
2224 



<210> 19 

<211> 2244 

<212> DNA 

<213> human 

<400> 19 
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gttgtttaaa agcaaggcat gcttgtggat 
60 

aagctgctcc ctggttccac tctggagagt 
120 



ttttttccct cctctttttt tgggggggag 
180 

tttttaattc attaaccagt ggttagcctt 
240 

ccacttccta gatctagtll agaaaacatg 
300 

tagtaaatgc ctcatttaat aacatactcc 
360 

tgagtagatc cagtatttga tgaaactcat 
420 

tcttttctag gacgcactat atgtgactgt 
480 

gattttttLg ggaagttaat ttctaacttc 
540 

cctttgaaat gcaccaaatg aattgagttt 
600 

tcattgtctt atatgcttag catagatttg 
660 

cagctgaaga cctgttatgt agaggaaata 
720 

tggaatgatt cacatcctct caagttagga 
780 

gggtagggtg ggggtgggga gaacacttaa 
840 



tatttctgtt ttgcatatga ggaaccctag 
900 

aaaatcatgg aaagactctt aatgcagact 
960 

cttattttgg ttgcagtttc caatttttaa 
1020 

aaacctaatt cttgtagatg cattagtgtt 
1080 

atatcatctt ttcagatgta ttaacaaaca 
U40 

acacaattcc taccaataat aaatcgatca 
1200 

ttggaccaca gtgcatgaaa acttcaacat 
1260 

aacaatccag agaaaagaaa attgcaataa 
1320 



gactctgtaa cagactaatt ggaattgttg 
aatctgggac atcttagtgt tttgttttgt 
tgtgtgtggg gtttqttttt tagtcttgtt 
aaggggagga ygacggattg attccacatt 
ttccccatct ggtgctctta ggaaggagta 
tttttgaaag ttgccttttc tctccaccct 
gaaaqtgggt ggagcccatc ttccccctcc 
gactttaagg acatttgttt gccatttgct 
tttcactgat aaatgaagaa aagtattgca 
gtaattaaaa aaattttttt tccctttcag 
cagctcagta gtatatgtgt tcctagaatg 
cgaggggtgg tgctagaaga cagacatctg 
ggatggaggc ctgcttcatt aagaagctgg 
caacatgggg accagtcagg ggaatcccct 
agcagccagg tgaggctctc tagLttaata 
cttcttaagt gttaataggg attttttcag 
aaatgttgag gtaatctttc ccaccttccc 
gaaccaatgc ttctcatgtc tcaatcttgt 
aaaccttaaa aagagtagat gaattgccaa 
actctatcta ttcaggaaag caggaagcat 
tctgttatta gataatgaat caaccaaatg 
taaaaggtaa attaacagaa agataatata 
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agcaagatag taatagttga ccattctgaa 
1380 

cctttctgaa aacaaaggat ttttaaatca 
1440 

gcaaaataat gcactatttc ttaacagcca 
1500 

aaatgaccat tgtgtcttaa tttagattat 
1560 

acccttaata gtatatacag tttctatgga 
1620 

tttgagagga cgttggtggg ctgaagatag 
1680 

tgggggttca caaggccctg acaccgtatt 
1740 

tacagctatt taaggagtag tcttaatagc 
1800 

atatgacaca gtttacattc tagtgaggga 
1860 

aatagatgct gataaatact gtagataaag 
1920 

atggagaagt catgattttt attttattta 
1980 

tctgtgcagg ctggagtgca atggtgtgat 
2040 

aagctatcct cccacctcag ctctcaagta 
2100 

cggctaattt tttgtagaga tgaggttttg 
2160 

ggactcaact gaccccacct cggcctctca 
2220 

agctggccat tgtttatgtt ctgc 
2244 



<210> 20 

<211> 351 

<212> DNA 

<213> human 

<220> 

<221> misc_feature 

<222> (62).. (62) 

<223> any kind of base 



<220> 

<221> misc feature 

<222> (121) . . (121) 

<223> any kind of base 



aagcttataa catcactcat catccagcat 
ctttatgcac atatacaaca taggaggttg 
tgtctcttgt agaacttcaa gttaatctac 
gaataccaca ttagtcaggt atttgcacta 
aaattcagtg gtccaaaaat ttccgtagaa 
ctccttgagg gtcactgatg taggctgcaa 
tatagtctaa cctttttatg aaaatctgac 
tgaaaatgaa gatagagaaa gacaccaaga 
cacaacaaaa tcaaatttaa aaaagagtgt 
cacataagaa aatagaaata aaggctgtca 
tttatttatt tattlgagac agagtcaggc 
ctcgctcact acaacctctg ctcctggctc 
gctgggatca caggtgcgtg ctaccatgcc 
ccatgttgcc caggctggtc tcgaactcct 
aagtgctgag attataggcg tgcagccggc 
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<220> 

<221> misc_feature 

<222> (207) . . (207) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (220).. (220) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (276) (276) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (300) . . (300) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (315) (315) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (336) (336) 

<223> any kind of; base 



<400> 20 

tctacttcca catcggcgag accgagaagc gctgtttcat cgaggaaatc cccgacgaga 
60 



cnatggtcat cggcaactat 
120 



nctcgacccc tggcctgggc 
180 

tgtcctggca gtacggctcg 
240 

atcaaatctg tctgcactcc 
300 

tgcgtgttgc atctngacat 



cgtacccaga tgtgggataa 
atgcacgtgg aagtgaagga 
gagggcnctt tcacgttcan 
aattcttacc aggatngctc 
ccaggtttgg gggagnatgc 



gcagaaggag gtcttcctgc 
ccccgacggc aaggtggtgc 
ctcccacacg cccggtgacc 
tctttcgctg gtgggcaaan 
caacaaatta c 



<210> 21 

<211> 2631 

<212> DNA 

<213> human 

<400> 21 

accttccaac ccagccctcg gctgagccgc gccgcaccat gcccgccgtg gacaagctcc 
60 

tgctagagga ggcgttgcag gacagccccc agactcgctc tttactgagc gtgtttgaag 
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120 

aagatgctgg caccctcaca 
180 

atggagccca gaat.gagatg 
240 

atgaaaaaca gaactttgct 
300 

atttttccaa agtggtggat 
360 

cagacacaat ggttctacct 
420 

ctttaaagga tctatttgga 
480 

gcaggctgcc taagaaaaag 
540 

ccgcggcccg gcggaagcag 
600 

tgcagtacag aaagcaaatg 
660 

ttaacttttt taagaaggga 
720 

ccgttgcaga catggttcaa 
780 

ggtgtcccag caagaattac 
840 

ggccgcacca cagatcaaca 
900 

caaaacaggg ctggtcaccg 
960 

tctcatgtgt cagcccaggg 
1020 

ctcagtgatg gccgtggatt 
1080 

tggaaaatcg ggaataatcc 
1140 

tgcaataaac aacatctcca 
1200 

caagttgaat cagaccgctc 
1260 

agaaagctca tgccccaqcc 
1320 



gattgttccc aaagcaacag 
1380 

gccgattcaa ttcgatattg 



gactatacca accagctgct 
tgcctggcca cacaacagct 
cttggcaaag gtgatgaaga 
gagcttaatc ttctccatac 
atcatacaat tccgagaaaa 
ctcgctagca atgagcatga 
gagaatgaga aggtgaagac 
cacctctcct cccttcagta 
gccatgat.gg agcccatgat 
gcagagatgt tttccaaacg 
agcattcagg tagaactgga 
tttctgttga tgaatctgtt 
ggaacctcat ccagaaggct 
ccacctggga gaggctttat 
gagccgtggc tggaggtltg 
gcgaagaccg gcgctactgc 
tccaggctga gagcagaaag 
gacagatcta cctgaccgac 
tgcaagcagt gactcctatt 
agaacctgaa aaattcagag 
ccagtctacc tgaagcagag 
tgcttcctgc tacagaattc 



ccaggcaatg cagcgcgtct 
ttctaagcaa ctgctggcat 
agtaatttca acactccact 
agagctggct aaacagttgg 
ggatctcaca gaagtaagca 
cctctcaatg gcaaaataca 
cgaagtcgga aaagaggtgg 
ctactgtgcc ctcaacgcgc 
aggctttgcc catggacaga 
tatggacagc tttttatcct 
accgaggcgg aaaagatgcg 
tacactccag actctgatgt 
ggttacctta atcttagaaa 
ttcttcaccc aaggcgggaa 
atccaggacc tggacaactg 
tttcagatca ccacgcccaa 
gaaaatgaag agtggatatg 
aaccctgagg cagtcgcgat 
acaagttttg gaaaaaaaca 
atggaaaatg aaaatgacaa 
gagctgatcg cgcctggagc 
cttgatcaga acagagggag 
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1440 

caggcgtacc aacccttttg gtgaaactga ggatgaatca tttccagaag cagaagattc 
1500 

tcttttgcag cagatgttta tagttcggtt tttgggatca atggcagtta aaacagacag 
1560 

cactactgaa gtgatttatg aagcgatgag acaagtattg gctgctcggg ctattcataa 
1620 

catcttccgc atgacagaat cccatctgat ggtcaccagc caatctttga ggttgataga 
1680 

tccacagact caagtatcaa gggccaattt tgaacttacc agtgtcacac aatttgctgc 
1740 

tcatcaagaa aacaagagac tggttggttt tgtcatccqt gttcctgaat ccactggaga 
1800 

agaatctctg agtacataca tttttgaaag caactcagaa ggcgaaaaga tatgttatgc 
1860 

tattaatttg ggaaaagaaa ttattgaggt tcagaaggat ccagaagcac tggctcaatt 
1920 

aatgctgtcc ataccactaa ccaatgatgg aaaatatgta ctgttaaacg atcaaccaga 
1980 

tgacgatgat ggaaatccaa atgaacatag aggcgcagaa tccgaagcat aactcacttg 
2040 

cgcctgtggg ggaagagcga acaggaagga gagctacctc ctaagggttt taacgtctct 
2100 

gacatacagg cacactgacc tgatttccga aggctgacaa tcgtttgtgg aatgtaatct 
2160 

tgatgccttg atactgagac ttgggaggga aactaagaaa tggttgacag cgttcccacc 
2220 

catctacaat gttattttag gtgctttgtg gtaagtcttt tttcttagat tgcgctaaaa 
2280 

tttcttagat tgttcagcgc tcagaacaaa agtttgaaaa atgcattgtt catatgaatg 
2340 

tcatctcttt tcagtttcca gtatcctttt taaaaaatgg caaaagccta gatttacaat 
2400 

ttgatgaaca ctaaatattt cttattaata taatctattt ttgtatttta cttaatgagc 
24 60 

tttaagtgcc tgtcgttctg aaaattgtgt atttataatt cagcttatct cataattgga 
2520 

cctaatagca tttctttgtg cagttaggtg atgagcactg ctttgaggcc caagcactag 
2580 

tagagatgcg cgatacaggt ctagtttcgg taactgttcc agacatcaag c 
2631 



<210> 22 
<211> 2851 
<212> DNA 
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<213> human 
<400> 22 

agcatctcag gccatcatcc tgaaacttgg cagccttcgt ggagtataag gacagcatta 
60 

ttagccatca ttgggtttac tgccaacaaa aggagaggga gccataggtt ctctagatta 
120 

cactcctgag gaaagaagag cacttgccaa aaaatcacaa gatttctgtt gtgaaggatg 
180 

tggctctgcc atgaaggatg tcctgttqcc tttaaaatct ggaagcgatt caagccaagc 
240 

tgaccaagaa gccaaagaac tggctaggca aataagcttt aaggcagaag tcaattcatc 
300 

tggaaagact atclctgagt cagacttaaa ccactctttt tcactaactg atttacaaga 
360 

tgatatacct acaacattcc agggtgctac ggccagtaca tcgtacggac tccagaattc 
420 

ctcagcagca tcctttcatc aacctaccca acctgtagct aagaatacct ccatgagccc 
480 

tcgacagcgc cgggcccagc agcagagtca gagaaggttg tctacttcac cagatgtaat 
540 

ccagggccac cagccaagag acaaccacac tgatcatggt gggtcagctg tactgattgt 
600 

catcctgact ttggcattgg cagctcttat attccgacga atatatctgg caaacgaata 
660 

catatttgac tttgagttat aatatggttt tgtgacttat gagctgtgac tcaactgctt 
720 

cattaaacat tctgcattgg gtataatcta agaattgttt acaaaaagat tattttgtat 
780 

ttacccttca ttcctttttt tgatccttgt aagtttagta taaatatatc tagacattca 
840 

gactgtgtct agcagttacg tcctgcttaa agggactaga agtcaaagtt ccttgtctca 
900 

ctatttgatc tgctttgcag ggaaataact tgttttttct catgtttcat cttcttttta 
960 

tgtaaatttg taatactttc ctataLtgcc ctttgaaatt tttggataaa agatgatgtt 
1020 

ttaagttcca atgagtatta ctagttactc aataccactt attgagtact ctgtttctac 
1080 

gtatgtagaa tgtataggga tagaagagtt gaaaagggaa agcaaaactt cttaagtggc 
1140 

ttccttaaaa tgtcattcat aggagatgta ctggaattgc tcattctgtg actttatttg 
1200 

tgtcctaaac attcttcagt gaaaataatt ttatttcagt caaacattta tgaggaaatg 
1260 



62 



agatcacatc tttgtcactg 
1320 

atgctgttat caccaccccc 
1380 

aaaacagtct tccatagatt 
1440 

ttcttccaga agttaaatgg 
1500 

tatgtctttt gggaatgtat 
1560 

ttcccaggaa tatgcataat 
1620 

gcccaagtaa ccagtgatag 
1680 

cagagcccaa gaaagaattt 
1740 

ccacagactc tagtagataa 
1800 

aaaaacattg ccttcagcat 
1860 

taatgaagat ttaattttta 
1920 

aaagtgttat ttgtaagtta 
1980 

gttctagatc ggtccacgaa 
2040 

tattcataaa gtccttatgt 
2100 

taaaatcgtc ttaaaqqcaa 
2160 

aactgtcaga caeaalltct 
2220 

aaaataagaa catattgtac 
2280 

tcgcatttta aaggtgttta 
2340 

ctaaaaagct caaggacttt 
2400 

tttataactc tattgccata 
2460 

aaccttgatt cagtcjctcag 
2520 

agaagaaaga aatccccacc 
2580 
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gatgctactt gaagagggag 
tgccctctgc tgccataatc 
tttaaggaag aaagggccca 
ggggatctga agatUtgaat 
tatatgccta gctttataat 
attgaatatt tcatgtccta 
aagttagaaa aaccccttta 
tcagtygaaa aatcaatata 
tattatcatc ataatggctg 
gttcagttcg cagcactgag 
aatacaggtg gttccaagct 
atttttttac aagtcaaaca 
agttagccca tatgtatatc 
ggttttaact aagtgaaatt 
atttaatttt tacccctgtt 
gttttcatct gagagccagg 
actattatat aatacagaat 
caggattatt ttttatatct 
atgaagatct cattatatga 
agaaaataca ctctaaaatc 
tggtctccta gtaagaagtc 
acctcaacct ctgctgagat 
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tactttgtaa ccactttgat 
acacaaattt aaaaagaaag 
agtcaggaga tcgcttggtt 
gttcggtctg ctttgaaatg 
caggtataaa attttaatta 
ttttaataga aaacctcagg 
cttagaattg tccacctagt 
taacttagtg ctagctagcg 
gtgaaaccat ataatcacag 
ggcactcttg agggtgttgt 
ttcaaatagg ttatgctcca 
atgttggaag tggtatttag 
ttgaatagta taggggaggg 
atggacaaga gaaataattg 
tatgggacat tcgttctatt 
tttcctttat ttctacatct 
tgtcttacac tttaataaat 
gtagctgaat ttgttaaagt 
ggaaaatcat aggttaccat 
ttgatttgaa acatattaga 
accgacggta gcgtcatatg 
tgtgtgctag gaacagcctt 
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ccctccgttt cccctcagtc aaacttgagc cagcctctgg atcgatgtga tcttattgca 
2640 

tgtttccatg gggtgtacct atactttaag ccaatcctgc tgcattcact gctaagttaa 
2700 

ataaaaagcc aagaagaaaa aaaaaatttt gcactgtgca gatcctttgc tatctgactt 
2760 

gcatctcttc ccccacctgt cagctagcca cctgcttgtt tgtgttggga tattttttag 
2820 

cacctgaagc accatctgaa aggggcacca t 
2851 



<210> 23 

<211> 3473 

<212> DNA 

<213> human 

<400> 23 

aagagcagcg gcgaggcggc ggUggtggct gagtccgtgg tggcagaggc gaaggcgaca 
60 

gctctagggg ttggcaccgg ccccgagagg aggatgcggg tccggatagg gctgacgctg 
120 

ctgctgtgtg cggtgctgct gagcttggcc tcggcgtcct cggatgaaga aggcagccag 
180 

gatgaatcct tagattccaa gactactttg acatcagatg agtcagtaaa ggaccacact 
240 

actgcaggca gagtagttgc tggtcaaata tttcttgatt cagaagaatc tgaattagaa 
300 

tcctctattc aagaagagga agacagcctc aagagccaag agggggagag tgtcacagaa 
360 

gatatcagct ttctagagtc tccaaatcca gaaaacaagg actatgaaga gccaaagaaa 
420 

gtacggaaac cagctttgac cgccattgaa ggcacagcac atggggagcc ctgccacttc 
480 

ccttttcttt tcctagataa ggagtatgat gaatgtacat cagatgggag ggaagatggc 
540 

agactgtggt gtgctacaac ctatgactac aaagcagatg aaaagtgggg cttttgtgaa 
600 

actgaagaag aggctgctaa gagacggcag atgcaggaag cagaaatggt gtatcaaact 
660 

ggaatgaaaa tccttaatgg aagcaataag aaaagccaaa aaagagaagc atatcggtat 
720 

ctccaaaagg cagcaagcat gaaccatacc aaagccotgg agagagtgtc atatgctctt 
780 

ttatttggtg attacttgcc acagaatatc caggcagcga gagagatgtt .tgagaagctg 
840 

actgaggaag gctctcccaa gggacagact gctcttggct ttctgtatgc clctggactt 
900 
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ggtgttaatt caagtcaggc aaaggctctt gtatattata catttggagc tcttgggggc 
960 

aatctaatag cccacatggt tttgggttac agatacLggg ctggcatcgg cgtcctccag 
1020 

agttgtgaat ctgccctgac tcactatcgt cttgttgcca atcatgttgc tagtgatatc 
1080 

tcgctaacag gaggctcagt agtacagaga atacggctgc ctgatgaagt ggaaaatcca 
114 0 

ggaatgaaca gtggaatgct agaagaagat ttgattcaat attaceagtt cctagctgaa 
1200 

aaaggtgatg tacaagcaca ggttggtctt ggacaactgc acctgcacgg agggcgtgga 
1260 

gtagaacaga atcatcagag agcatttgac tacttcaatt tagcagcaaa tgctggcaat 
1320 

tcacatgcca tggccttttt gggaaagatg tattcggaag gaagtgacat tgtacctcag 
1380 

agtaatgaga cagctctcca ctactttaag aaagctgctg acatgggcaa cccagttgga 
1440 

cagagtgggc ttggaatqgc ctacctctat gggagaggag ttcaagttaa ttatgatcta 
1500 

gcccttaagt atttccagaa agctgctgaa caaggctggg tggatgggca gctacagctt 
1560 

ggttccatgt actataatgg cattggagtc aagagagatt ataaacaggc cttgaagtat 
1620 

tttaatttag cttctcaggg aggccatatc ttggctttct ataacctagc: tcagatgcat 
1680 

gccagtggca ccggcgtgat gcgatcatgt cacactgcag tggagttgtt taagaatgta 
1740 

tgtgaacgag gccgttggtc tgaaaggctt atgactgcct ataacagcta taaagatggc 
1800 

gattacaatg ctgcagtgat ccagtacctc ctcctggctg aacagggcta tgaagtggca 
1860 

caaagcaatg cagcctttat tcttgatcag agagaagcaa gcattgtagg tgagaatgaa 
1920 

acttatccca gagctttgct acattggaac agggccgcct ctcaaggcta tactgtggct 
1980 

agaattaagc tcggagacta ccatttctat gggtttggca ccgatgtaga ttatgaaact 
2040 

gcatttattc attaccgtct ggcttctgag cagcaacaca gtgcacaagc tatgtttaat 
2100 

ctgggatata tgcatgagaa aggactgggc attaaacagg atattcacct tgcgaaacgt 
2160 

ttttatgaca tggcagctga agccagccca gatgcacaag ttccagtctt cctagccctc 
2220 
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tgcaaattgg gcgtcgtcta tttcttgcag tacatacggg aaacaaacat tcgagatatg 
2280 

ttcacccaac ttgatatgga ccagcttttg ggacctgagt gggaccttta cctcatgacc 
2340 

atcattgcgc tgctgttggg aacagtcata gctLacaggc aaaggcagca ccaagacatg 
24 00 

cctgcaccca ggcctccagg gccacggcca gctccacccc agcaggaggg gccaccagag 
2460 

cagcagccac cacagtaata ggcactgggt ccagccttga tcagtgacag cgaaggaagt 
2520 

tatetgctgg gaacacttgc atttgattta ggaccttgga tcagtggtca cctcccagaa 
2580 

gaggcacggc acaaggaagc attgaattcc taaagctgct tagaatctga tgcctttatt 
2640 

ttcagggata agtaactctt acctaaactg agctgaatgt ttgtttcagt gccatatgga 
2700 

ataacaactt tcagtggctt ttttttttct tttctggaaa catatgtgag acactcagag 
2760 

taatgtctac tgtatccagc tatctttctt ggatcctttt ggtcattatt tcagtgtgca 
2820 

taagttctta atgtcaacca tctttaaggt attgtgcatc gacactaaaa actgatcagt 
2880 

gtaaaaagga aaacccagtt gcaagtttaa acgtgttcga aagtctgaaa atagaacttg 
2940 

ccttttaagt taaaaaaaaa aaaagctatc ttgaaaatgt tttggaactg cgataactga 
3000 

gaaactctta ccagtccaca tgcaattaga catattcagc atatttgtta ttttaaaagg 
3060 

gagggttggq aggtttctta ttggtgattg tcacacggta taccatactc ctctccttca 
3120 

aagaatgaaa ggccttgtta aggagttttt tgtgagcttt acttctttgg aatggaatat 
3180 

acttatgcaa aaccttgtga actgactcct tgcactaacg cgagtttgcc ccacctactc 
324 0 

tgtaatttgc ttgtttgttt tgaatataca gagccttgat ccagaagcca gaggatggac 
3300 

taagtgggag aaattagaaa acaaaacgaa ctctggttgg ggtactacga tcacagacac 
3360 

agacatactt ttcctaaagt tgaagcattt gttcccagga tttattttac tttgcatttc 
3420 

cttttgcaca aagaacacat caccatttcc ttttgcacaa agaacacatc acc 
3473 



<210> 24 
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<211> 
<212> 
<213> 



401 
DNA 
human 



5 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(252) . . (252) 
any kind of base 



10 



<220> 
<221> 
<222> 
<223> 



misc feature 



(303) , . (303) 
any kind of base 



15 



<220> 



<221> misc_feature 
<222> (390)'.. (390) 
<223> any kind of base 



<400> 24 

ttagattatt ttcaatttat tatfccagaat aaatatatct tttttcttta acttctcaaa 
60 

tagttattga attgtattgg tttaaattaa atgcgtcatg tgtatatatc agtattaatt 
120 

caagagatac aaaaggaaat tgagtgaaaa ataagtctgc ctccttccca tcactctcat 
180 

gtctctacct agaggcaatt attgtcaaca gtttttgatg tgtctttcaa aaaatagtcc 
240 

attaagcctg gngtactaga tctcttttaa aagtttacaa cctgttacag aatatatata 
300 

aangttcaat tactagtaac accttattac agatacagat tacaacttag gaaatatatt 
360 

ttcatggacc attgatgtca tttggattcn cccctacaat c 
401 



<210> 25 

<211> 1820 

<212> DNA 

<213> human 

<400> 25 

aatgtcttag aaaaaggctt tctaaaagaa aaagagcaag aggccatttc ttttcaagat 
60 

agatacaaag aacttcagga aaaacataaa caagaattgg aagacatgag gaaagctggt 
120 

cacgaagccc tcagcattat tgtggatgaa tataaggcac tactgcagtc ttcagttaag 
180 

caacaagtag aagctattga aaaacagtac atttctgcaa ttgagaaaca ggcacacaag 
24 0 

tgtgaggagt tgctaaatgc tcagcatcag aggctccttg aaatgctaga tacagagaag 
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300 
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gaactgttaa aagaaaaaat aaaggaagct ttgattcagc aatctcaaga acagaaggaa 
360 

atattggaaa agtgtttgga ggaagaaagg caaagaaata aagaggcatt agtatccgct 
420 

gcaaagcttg aaaaagaagc agtgaaggat gcagttttaa aagtcgtaga agaagaaaga 
480 

aaaaatttag aaaaagcgca tgctgaagaa agggaattat ggaagacaga acatgcaaaa 
540 

gatcaagaaa aagtatctca ggaaattcaa aaagctatac aagaacaaag aaaaataagt 
600 

caggaaactg Ltaaggcagc aataatagaa gagcagaaac gaagtgaaaa ggctgtggaa 
660 

gaggcagtga aaagaacaag agatgaattg atagagtata taaaagaaca gaaaaggctc 
720 

gatcaagtca tccgccaaag aagcctgtcc agtttggaac tgttcctctc ctgtgcacag 
780 

aaacagttaa gtgctttaat agctacggaa ccagttgaca ttgaataaaa agaacatgac 
840 

aaacccacac tggcattgga taaatcatat tacaccttca aaatacacac tctgaattat 
900 

aaagatgtgt ttgttttctt tccaaatcat gtagaattga tttccagttc aaggataaac 
960 

caaaacaata tttagaacta tcaagtgatc taatttattt tcttttggtt tcttctttac 
1020 

atttactgtt attttattat tattagtagt agcagcaaca gagtatgata tgacccaaaa 
1080 

gccattgtaa agtgccacat taccaaaatt aattaagtaa actttatagc ctgtgggagt 
1140 

ctattatata ttattttgca aaagtagtaa atatattatt gtttcatgat gactcttgat 
1200 

gagatgctag aatgtaacca tacatttatc ttattttgag gatagaaata gcatggattt 
1260 

caacatcact tatLtatctg tataattgga aataaaacac cgatatgata gagaatcatt 
1320 

ccggcattac ctaacctctt ctgcagttgg atctatgtat tttcattggt ctactgaaaa 
1380 

cgaacaatac aattaaaagc actaaagatt attatattaa ttcaactttg atctgatata 
1440 

tcacttaaac taaaggggtg tgtgtggtgt atgcttgttt cctatttctg ctctttaaag 
1500 

atactttgaa tcaataaaac cattagtcta caaatcaaat tgtgaactta atctctagaa 
1560 

agagaatata actcagccat ttataggaat ttaggttcaa gtacaggata tatgaaatct 
1620 
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tttcccagta tttcagaatg tacttaattc acaggcagga tgcttcaatg caaaatcatg 
1680 

aatattttta attcaaaact aaaatgtcat taatatgtat gtatgcaaat gttttatctt 
1740 

attttctgaa atgcatctac tttcatgggc tttgtacgtt tctgagattt ctcagtgtaa 
1800 

taaaaagagc tcccaaactt 
1820 



<210> 26 

<211> 280 

<212> DNA 

<213> human 

<220> 

<221> misc_f eature 

<222> (261) . . (261) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (237). . (237) 

<223> any kind of base 



<400> 26 

tcaagtcata agataaagtt taatcatttg atcatgttaa aagacacaaa acacagccaa 
60 

tctaaccaaa tttcaggcat gcatttacat aaatatatta aattaagaaa agaaattgta 
120 

cacttaaacg tccttttcac ctagaaatca ttaaatccac agatcaacaa taaaaccaat 
180 

tctctgcatt taccacttca agatacaatt gttctatttt aaagataaca caaactncac 
240 

tagtctggtt aggaatttat ntgcattata catatattat 
280 



<210> 27 

<211> 392 

<212> DNA 

<213> human 

<400> 27 

ttggtttgaa atggcacccc aggactttgg gcctgcctta cttgatagcc tcgttcagtg 
60 

agcaaagact tagtgagcag ctcttgtatg ccaagtattt tgctaagctc tggaaaaaag 
120 

ataaacaaga catggttctt gctttcaagg agtgtgtaat tctttagcca gatatggaaa 
180 

cctggaccct gagtgggaga aaggagacag atgaaaggag tccgtgattt tgtaaccaag 
240 

agctgcctgc atggttatga gtatcactga ttttagggac gcccacagag ctaaagcatt 
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300 

tttttaatcc gagaagactt ttgtaactca tattagttaa tcttctagct ctgagatagc 
360 

aacacagctc ttagaattct gtaagtaagc tt 
392 



<210> 28 

<211> 2299 

<21.2> DNA 

<213> human 

<40O> 28 

cgaaccccca cagctggagg gcgaggccag ctgtacccgg ccccagtgcc ctttcgcggc 
60 

cacaagcggc cgtcctcctg gtccggtgct ccggcgcctg atctaggttc atggagccgg 
120 

ggctgtggct ccttttcggg cLcacagtga cctccgccgc aggattcgtg ccttgctccc 
i80 

agtctgggga tgctggcagg cgcggcgtgt cccaggcccc cactgcagcc agatctgagg 
240 

gggactgtga agagactgtg gctggccctg gcgaggagac tgtggctggc cctggcgagg 
300 

ggactgtggc cccgacagca ctgcagggtc caagccctgg aagccctggg caggagcagg 
360 

cggccgaggg ggccccLgag caccaccgat ccaggcgctg cacgtgcttc acctacaagg 
420 

acaaggagtg tgtctactat tgccacctgg acatcatttg gatcaacact cccgaacaga 
480 

cggtgcccta tggactgtcc aactacagag gaagcttccg gggcaagagg tctgcggggc 
540 

cacttccagg gaatctgcag ctctcacalc ggccacactt gcgctgcgct tgtgtgggga 
600 

gatatgacaa ggcctgcctg cacttttgca cccaaactct ggacgtcagc agacaggttg 
660 

aagtcaagga ccaacaaagc aagcaggctt tagacctcca ccatccaaag ctcatgcccg 
720 

gcagtggact cgccctcgct ccatctacct gcccccgctg cctctttcag gaaggagccc 
780 

cttaggagga caggcctgca gcatcctggt ctcgggaggc ttctgtcatt gctcacacac 
840 

agttcagatt tccacctctt tatagacaag aagtgaattt gcctggggca gaacacccac 
900 

ccaaagagtc cccacttaac aatacccccc ccccacggca agaatgccca aatccgaatg 
960 

accccagttt tcctaatgag taaaatgatc ccagatgtgc cccagagcat gacgcctgca 
1020 
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gctccggttt catgcaggaa 
1080 

ctggcttttg acatgacttc 
1140 

tgtaaacaca tgtccatctt 
1200 

tttcatatcc ttgcctagaa 
1260 

ggctttagac acagatcata 
1320 

ggcagagaaa ttttcagctg 
1380 

aggaagaaga aaaaaggatc 
1440 

caatacagag cttgttcctg 
1500 

cgatacacag tggagttccc 
1560 

gctcaaggct attaggttga 
1620 

gcttcaaaat aagtcacgaa 
1680 



aattggcaac aacttatacc 
1740 



ccgagccgag cttactgtga 
1800 

ggggagggct gcccatctcc 
1860 



ggtgtccagg gccccgtaga 
1920 

ccaaatgtta aatcctctgt 
1980 

tatcatgagg aaatgaaagt 
204 0 

gattatttat tgtgaaactg 
2100 

gtcactgtat atacgtatag 
2160 



tacaaactca tactccttag 
2220 

caccgtggca agatggtatc 
2280 



tgtcaaaagc ctaataaaa 
2299 
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attggttttg gagagttttg 
tcttggagaa taagtggact 
gtaataaatg caaaatgccc 
taggctgcat ggtgtatgtc 
gctctacagg agtttatgaa 
tgcttgatac ccaccaaaag 
cttgatgttt gtgacaagaa 
ttcagtgact gaccctctgt 
aggccttgtt tgcaggaagc 
atatttgctt tcatgagtaa 
cacaaattct ttgtaaatta 
gtctgacagt tcaaaatctc 
gtgtggagat gttatcccac 
ccaacccagt cacagagaga 
gagacattta agatggtgta 
gtgtatttca laagttatta 
ggctgatttg ctggtaggat 
ttctccactc caactccttt 
agaggtagat aggtaggtag 
agcttgaatt acatttttaa 
agagagaaac ccatcaattg 



71 



gcaagttgga aagccactta 
ccaagctaac tctttgcaaa 
gtgcagcaga agcatgcgac 
agtgagggcc acgaggcgtc 
tttgaagctt atgggatttt 
aatgtatctc gaaagaatga 
aatgagaaag ttagtatctg 
attctgtata gacaccaggc 
cgactgtaaa gacagcccca 
atgtggatct ttggggaatg 
tgtaaattcc tgtttatala 
tttcagctgc gctcttccca 
catgtaaagt cgcctgcgca 
taggaaacgg catttgagtg 
tgacagagca ttggccttga 
caggtataaa agtgatgacc 
tttgtacagt ttagagaagc 
atgtggatct gttcaaagta 
attttaaatt gcattctgaa 
aatgcatatg tgctgtttgg 
ctcaaafcact cagaaagtac 
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<210> 29 

<211> 1339 

<212> DNA 

<213> human 

<400> 29 

ctaaacaaaa tcattcactt ccctgatttt 
60 

ctgcctctcc tctacgttgg aaaccacata 
120 

ctaccgatgt tcaccgtgct caggaaattc 
180 

atcatacttg ggaagcagta ttcactcaac 
240 

ggggctttca tagcagctgg gtctgacctt 
300 

ttcctgaatg atatcttcac atcagcaaat 
360 

aaggagctag ggaaatacgg agtacttttc 
420 

cttattatta gtgtctccac tggagacctc 
480 

aatgttgtgt ttatcctaca gtttcttctt 
540 

tccacggttc tgtgcagcta ttacaattca 
600 

aagaatgtat ccgttgccta cattgggata 
660 

ttaaactttg tagggttaaa tatttgcatg 
720 

ctgagcagcc agttaaaacc taaacctgtg 
780 

agctaaagag tctgcagcag gattggagac 
840 

cccagtagga atgtgaagcc agaggtttcg 
900 

agagcatctg caaaatgcaa agagaactac 
960 

caagaaatgt actcgggcga caccttacct 
1020 

ctgggactcg gtaggtggag. ccccagctgc 
1080 

atctctgtgc tgtgtgctgg ggaggaggtt 
1140 

ccttggccgt gattcccaag gtcccagcca 
1200 



gataagaaaa ttcctgtaaa gctgtttcct 
agtggattat caagcacaag taaattaagc 
accattccac ttaccttact tctggaaacc 
atcatcctca gtgtctttgc cattattctc 
gcttttaact Lagaaggcta tatttttgta 
ggagtttata ccaaacagaa aatggaccca 
tacaatgcct gcttcatgat tatcccaact 
caacaggcta ctgaattcaa ccaatggaag 
tcctgLtttt tggggtttct gctgatgtac 
gccctgacga cagcagtggt tggagccatc 
ttaatcggtg gagactacat tttctctttg 
gcagggggct tgagatattc ctttLLaaca 
ggtgaagaaa acatctgttt ggatttgaag 
tgacttgtga ctgcgggctg ggggggcatt 
gattcgtgac atccaccccc tgggcaagtg 
ctcatatgca ggatgagcca atggcaglcL 
gtggaaagca aatcttttca aaataagcca 
tcttctaggg acctatgggg ccttcgtggc 
gatgtaatgg tgactctttt ctgatcagca 
aagcaaaggg ccagttgttt cagtttaaac 
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agacatgtct ttagtctaat aaaattagtt aactgccagt aaagttattt gttagcttlg 
1260 

atgaaagcta tgttggtatc tttccctaat catcaaagta aataaaaaat catttctatg 
1320 

taaaaaaaaa aaaaaaaaa 
1339 



<210> 30 

<211> 4250 

<212> DNA 

<213> human 

<400> 30 

gaacacatcg cgtttgcatc ccagaaagta gtcgccgcga ctatttcccc caaagagaca 
60 

agcacacatg taggaatgac aaaggcttgc gaaggagaga gcgcagcccg cggcccggag 
120 

agatcccctc gataatggat tactaaatgg gatacacgct gtaccagttc gctccgagcc 
180 

ccggccgcct gtccgtcgat gcaccgaaaa gggtgaagta gagaaataaa gtctccccgc 
240 

tgaactacta tgaggtcaga agccttgctg ctatatttca cactgctaca ctttgctggg 
300 

gctggtttcc cagaagattc tgagccaatc agtatttcgc atggcaacta tacaaaacag 
360 

tatccggtgt ttgtgggcca caagccagga cggaacacca cacagaggca caggctggac 
420 

atccagatga ttatgatcat gaacggaacc ctctacattq ctgctaggga ccatatttat 
480 

actgttgata tagacacatc acacacggaa gaaatttatt gtagcaaaaa actgacatgg 
540 

aaatctagac aggccgatgt agacacatgc agaatgaagg gaaaacataa ggatgagtgc 
600 

cacaacttta ttaaagttct tctaaagaaa aacgatgatg cattgtttgt ctgtggaact 
660 

aatgccttca acccttcctg cagaaactat aagatggata cattggaacc attcggggat 
720 

gaattcagcg gaatggccag atgcccatat gatgccaaac atgccaacgl tgcactgttt 
780 

gcagatggaa aactatactc agccacagtg actgacttcc ttgccattga cgcagtcatt 
840 

taccggagtc ttggagaaag ccctaccctg cggaccgtca agcacgattc aaaatggttg 
900 

aaagaaccat actttgttca agccgtggat tacggagatt atatctactt cttcttcagg 
960 

gaaatagcag tggagtataa caccatggga aaggtagttt tcccaagagt ggctcaggtt 
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1020 

tgtaagaatg atatgggagg atctcaaaga gtcctggaga aacagtggac gtcgttcctg 
1080 

aaggcgcgct tgaoctgctc agttcctgga gactctcatt tttatttcaa cattctccag 
1140 

gcagttacag atgtgattcg tatcaacggg cgtgatgttg tcctggcaac gttttctaca 
1200 

ccttataaca gcatccctgg gtctgcagtc tgtgcctatg acatgcttga cattgccagt 
1260 

gtttttactg ggagattcaa ggaacagaag tctcctgatt ccacctggac accagttcct 
1320 

gatgaacgag ttcctaagcc cagqccaggt Lgctgtgctg gctcatcctc cttagaaaga 
1380 

tatgcaacct ccaatgagtt ccctgatgat accctgaact tcatcaagac gcacccgctc 
1440 

atggatgagg cagtgccctc catcttcaac aggccatggt tcctgagaac aatggtcaga 
1500 

taccgcctta ccaaaattgc agtggacaca gctgctgggc catatcagaa tcacactgtg 
1560 

gtttttctgg gatcagagaa gggaatcatc ttgaagtttt tggccagaat aggaaatagt 
1620 

ggttttctaa atgacagcct tttcctggag gagatgagtg tttacaactc tgaaaaatgc 
1680 

agctatgatg gagtcgaaga caaaaggatc atgggcatgc agctggacag agcaagcagc 
1740 

tctctgtatg ttgcgttctc tacctgtgtg ataaaggttc cccLtggccg gtgtgaacga 
1800 

catgggaagt gtaaaaaaac ctgtattgcc tccagagacc catattgtgg atggataaag 
1860 

gaaggtggtg cctgcagcca tttatcaccc aacagcagac tgacttttga gcaggacata 
1920 

gagcgtggca atacagatgg tctgggggac tgtcacaatt cctttgtggc actgaatgac 
1980 

atttcaactc ctctaccaga taatgaaatg tcttacaaca cagtgtatgg gcattccagt 
2040 

tccctcttgc ccagcacaac cacatcagat tcgacggctc aagaggggta tgagtctagg 
2100 

ggaggaatgc tggactggaa gcatctgctt gactcacctg acagcacaga ccctttgggg 
2160 

gcagtgtctt cccataatca ccaagacaag aagggagtga ttcgggaaag ttacctcaaa 
2220 

ggccacgacc agctggCtcc cgtcaccctc ttggccattg cagtcatcct ggctttcgtc 
2280 

atgggggccg tcttctcggg catcaccgtc tactgcgtct gtgatcatcg gcgcaaagac 
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2340 



gtggctgtgg tgcagcgcaa ggagaaggag ctcacccact cgcgccgggg cfcccatgagc 
2400 



agcgtcacca agctcagcgg cctctttggg gacactcaat ccaaagaccc aaagccggag 
2460 



gccatcctca cgccactcal gcacaacggc aagctcgcca ctcccggcaa cacggccaag 
2520 



atgctcatta aagcagacca gcaccacctg gacctgacgg ccctccccac cccagaghca 
2580 



accccaacgc tgcagcagaa gcggaagccc agccgcggca gccgcgagtg ggagaggaac 
2640 



cagaacctca tcaatgcctg cacaaaggac atgcccccca tgggctcccc tgtgattccc 
2700 



acggacctgc ccctgcgggc ctcccccagc cacatcccca gcgtggtggt cctgcccatc 
2760 



acgcagcagg gctaccagca tgagtacgtg gaccagccca aaatgagcga ggtggcccag 
2820 

atggcgctgg aggaccaggc cgccacactg gagtataaga ccatcaagga acatctcagc 
2880 



agcaagagtc ccaaccatgg ggtgaacctt gtggagaacc tggacagcct gccccccaaa 
2940 



gttccacagc gggaggcctc cctgggtccc ccgggagcct ccctgtctca gaccggtcta 
3000 



agcaagcggc tggaaatgca ccactcctct tcctacgggg ttgactataa gaggagctac 
3060 



cccacgaact cgctcacgag aagccaccag gccaccactc tcaaaagaaa caacactaac 
3120 

tcctccaatt cctctcacct ctccagaaac cagagctttg gcaggggaga caacccgccg 
3180 

cccgccccgc agagggtgga ctccatccag gtgcacagct cccagccatc tggccaggcc 
324 0 



gtgactgtct cgaggcagcc cagcctcaac gcctacaact cactgacaag gtcggggctg 
3300 



aagcgtacgc cctcgctaaa gccggacgta ccccccaaac catcctttgc tcccctttcc 
3360 

acatccatga agcccaatga tgcgtgtaca taatcccagg gggagggggt caggtgtcga 
3420 



accagcaggc aaggcgaggt gcccgctcag ctcagcaagg ttctcaactg cctcgagtac 
3480 

ccaccagacc aagaaggcct gcggcagagc cgaggacgct gggtcctcct ctctgggaca 
3540 

caggggtact cacgaaaact gggccgcgtg gtttggtgaa ggtttgcaac ggcggggact 
3600 



cacct tcatt ctcttccttc actttccccc acaccctaca acaggtcgga cccacaaaag 
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3660 

acttcagtta tcatcacaaa catgagccaa aagcacatac ctaccccatc ccccaccccc 
3720 

acacacacac acacatgcac acaacacata cacacacacg cacagaggtg aacagaaact 
3780 

gaaacatttt gtccacaact tcacgggacg tggccagact gggtttgcgt tccaacctgc 
3840 

aaaacacaaa tacatttttt aaaatcaaga aaatttaaaa agacaaaaaa aaaagaattc 
3900 

attgataatt ctaactcaga ctttaacaat ggcagaagtt tactatgcgc aaaLactgtg 
3960 

aaatgcccgc cagtgttaca gctttctgtt gcagcagata aatgccatgt tgggcaacta 
4020 

tgtcatagat ttctgctcct cctctctttt aatgaaataa cgtgaccgtt aacgcaagta 
4080 

actctttatt tattgttcac cctttttttc cttaaggaaa ggactcttcc aaatatcatc 
4140 

ctatgaacag ctcttcagaa agcccattga aagttaaact atttaacgLg aaatccatta 
4200 

actggaataa ttgagtttct ttatttttac aataaattca ctgagtaaat 
4250 



<210> 31 

<211> 2705 

<212> DMA 

<213> human 

<400> 31 

ctttagccca acagtcaaaa ataattgatg ctaccctaca aatgtccaaa actctagtat 
60 

atcatatttc taagttacag caaatattag tcctgctaaa ccagggagct ttggcaaaaa 
120 

tgttttttga cagtaaattt gtccttgatt atatattaac tagtcaaaga ggtgtttgta 
180 

acattattag agcttcttgt tgtaggtggg ttaacaccac caatcaagag gtcattctaa 
240 

cagaaagcct ggatcagaaa accatcaccc taaaaaaaca tgccttacat atttaacaca 
300 

ctctgaaatc cagtcaaaat atgactaaag gcccttgcca tgactgatgt attctcctgg 
360 

ccaacgccaa acaaatggga gcctggttac gagtcagcct tcagggactt gtcacatttc 
420 

tacttggttt cttccttgtt attgtcataa taaaatgttt tctatgctgt ttagtgcaac 
480 

ttaggcccta ttctgtagaa gtctcctcta ctattcaggc cactcaaaca ccccaaataa 
540 
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ttgagttcaa aatzcgacatc 
600 

ggtattttta ttgattattg 
660 

gtcagtttcc tctcttgagc 
720 

ttctcacact ctggggccac 
780 



aattaaggac agcgcccatg 
840 

ttcaacctag ctaaccccac 
900 

tgtggtaccc agtcctcagg 
960 

ttcagagctg ttttccacag 
1020 

gagaaaaaga aaactcagta 
1080 



aatggtcctg aacaatggag 
1140 



gttgatatta aaaccagtga 
1200 



tacaacccag ccacagccaa 
1260 



gaggacaacg ataaccgatc 
1320 



tttcagtttc attttcactg 
1380 



ggagtcaaat attctgccga 
1440 



cttgctgaag ctgcctcaaa 
1500 



ggtgaggcca acccaaagct 
1560 



ggcaaacgag ccccattcac 
1620 

ttctggacct accctggctc 
1680 



atctgtaagg agagcatcag 
1740 



tcaaatgttg aaggtgataa 
1800 

ctgaagggca gaacagtgag 
1860 
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aagatataaa ggaatcagtg 
tgctgtcttg acctagtatg 
agctgattaa atccacaccc 
tatgtaccca ctctaatcac 
ccccaaagcc cgccaaaatt 
cctttttgct gtacataagc 
ugcaaccccc Lgcgtggtcc 
aggtagtgaa aagaactgga 
gaagataatg gcaagtccag 
caagctgtat cccattgcca 
aaccaaacat gacacctctc 
agaaattatc aatgtggggc 
agtgctgaaa ggtggtcctt 
gggcagtaca aatgagcatg 
gcttcacgta gctcactgga 
ggctgatggt ttggcagtta 
gcagaaagta cttgatgccc 
aaattttgac ccctctactc 
tctgactcat cctcctctUt 
tgtcagctca gagcagctgg 
cgctgtcccc atgcagcaca 
agcttcattt tgatgattct 



actaaatata tttcatatat 
gaggccttgg ctagaggctg 
caaccacttc ccttatcagg 
cacagggcca gacatcagac 
atgcaaatta ttcaaaatta 
tgcccattcc ccctccagcc 
tctgtggcag ccttctctca 
ttttcaagtt cactttgcaa 
actggggata tgatgacaaa 
atggaaataa ccaatcccct 
tgaaacctat tagtgtctcc 
attctttcca tgtaaatttt 
tctctgacag ctacaggcCc 
gttcagaaca tacagtggat 
attctgcaaa gtactccagc 
ttggtgtttt gatgaaggtt 
tccaagcaat taaaaccaag 
tccttccttc atccctggat 
atgagagtgt aacttggatc 
cacaattccg cagccttcta 
acaaccgccc aacccaacct 
gagaagaaac ttgtccttcc 
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tcaagaacac agccctgctt ctgacataat ccagttaaaa taataatttt taagaaataa 
1920 

atttatttca atattagcaa gacagcatgc cttcaaatca atctgtaaaa ctaagaaact 
1980 

taaattttag ttcttactgc ttaattcaaa taataattag taagctagca aatagtaatc 
2040 

tgtaagcata agcttatctt aaattcaagt ttagtttgag gaattctt ta aaattacaac 
2100 

taagtgattt gtatgtctat ttttttcagt ttattLgaac caataaaata attttatctc 
2160 

tttctttctg ttgtgcattc agtttctaaa accattaagt ttctactcca tttacattca 
2220 

aaaatcttaa atactttact tgcaagagta ttttgcttca aatacaacaa cctaagagca 
2280 

gctggagatg aaatattggg aaattcattt gcttactcct gaagacaaaa atatagctga 
2340 

gatgaccact ggatttaata tcgttatgcl ggcccaacat tgctaccatt tgtgttgtct 
2400 

gtgatcaaaa tgattatctt ttatatagga agatgacgct tctggatatt gctttcactt 
2460 

cttctcccca cgttagcaag gacaatgctt ctctgccatt attacaacta gttagttlgc 
2520 

atggagaatc tttactttaa aattggaaga aaagtcacaa gtgaatggtt tataaaaatg 
2580 

ctaaagaagt cattcttgct tagaatcata tagaaacatc atgcaatctt ttagtcagat 
2640 

gtgcgcttca ccttatgcta tttttatctt taattgacac acaataattg tacatgttta 
2700 

tggagtatag tgtggtgttt tctgtttgtt tgtttgtttt ttgagacaag gtctcactct 
2760 

gccagtcagg gtggagtgcg atggt 
2785 



<210> 32 

<211> 9588 

<212> DNA 

<213> human 

<400> 32 

ccgaccaaca ccaacaccca gctccgacgc agctcctctg cgcccttgcc gccctccgag 
60 

ccacagcttt cctcccgctc ctgcccccgg cccgtcgccg tctccgcgct cgcagcggcc 
120 

tcgggagggc ccaggtagcg agcagcgacc tcgcgagcct tccgcactcc egcccggttc 
180 

cccggccgtc cgcctatcct tggccccctc cgctttctcc gcgccggccc gcctcgctta 
24 0 
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tgcctcggcg ctgagccgct ctcccgattg 
300 



cacccgcgga tcaacactct gggccgcatg 
360 



tacgaggtga ccagcggcgg cgggggcacc 
420 



atcaccgacc agaactcgga cggctactgt 
480 



cagaacacca tccaggagct gctgcagaac 
540 



atcgtgcagc ctgaattgaa gtatggagat 
600 



gatgagtgtt ttgcccaggc caatgaccaa 
660 



atgcggcaga tgggccagcc ctgtgatgct 
720 



caaatgcgag ccctttataa agccatcagt 
780 



ggtggtggag gctacacttg tcagagtggc 
840 



accagtgaat gtttggggtg gatgaggcag 
900 



ggtgtggacc tggcctcagt ggagcagcac 
960 



atcggcgact atcgctggca gctggacaaa 
1020 



atctaccagt tggaggagga gtatgaaaac 
1080 



cacctgcgac agctgcagaa catcattcag 
1140 



gactgcgagg aggaggagct gctgtacgac 
1200 



aaacaggagg ccttctccat acgcatgagt 
1260 



aagctgaaac aagaaagtga ccaacttgtc 
1320 



gaggcctata tggacactct gcagacgcag 
1380 



attgatgttc atctgaaaga aaatgctgcc 
1440 



actgaagcat acctgaaggg gctccaggac 
1500 



aacatgcccc tgcagcacct gctggaacag 
1560 



cccgccgaca tgagctgcaa cggaggctcc 
atccgcgccg agtctggccc ggacctgcgc 
agcaggatgt actattctcg gcgcggcgtg 
caaaccggca cgatgtccag gcaccagaac 
tgctccgact gcttgatgcg agcagagctc 
ggaatacaac tgactcggag tcgagaattg 
atggaaatcc tcgacagctt gatcagagag 
taccagaaaa ggcttcttca gctccaagag 
gtccctcgag Uccgcagggc cagctccaag 
tctggctggg atgagttcac caaacatgtc 
caaagggcgg agatggacat ggtggcctgg 
attaacagcc accggggcat ccacaactcc 
atcaaagccg acctgcgcga gaaatctgcg 
ctgctgaaag cgtcctttga gaggatggat 
gccacgtcca gggagatcat gtggatcaat 
tggagcgaca agaacaccaa catcgctcag 
caactggaag ttaaagaaaa agagctcaaL 
ctcaatoagc atccagcttc agacaaaatt 
tggagttgga ttcttcagat caccaagtgc 
tactttcagt tttttgaaga ggcgcagtct 
tccatcagga agaagtaccc ctgcgacaag 
atcaaggagc tggagaaaga acgagagaaa 
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atccttgaat acaagcgLca ggtgcagaac 
1620 

ctgaagcctc gtaacccaga ctacagaagc 
1680 



gactacaaac aagatcagaa aatcgtgcat 
1740 



aacgagcgca gcaagtggta cgtgacgggc 
1800 



gtggggctga tcatccctcc tccgaaccca 
1860 



cagtactacg aagccatctt ggctctgtgg 
1920 



gtgtcctggc actactgcat gattgacata 
1980 



ctgaaaacaa tgcggcagga agattacatg 
2040 



caagagttca tcagaaatag ccaaggctca 
2100 



atacagtctc agttcaccga tgcccagaag 
2160 



ggctatcccc agcaccagac agtgaccaca 
2220 

gatgtcaacc ataataaagt aattgaaacc 
2280 



atgctgatgg agctgcagaa gattcgcagg 
2340 



ctcaaaaacc tccctctagc agaccagggg 
2400 

gagcttaaga gtgtgcagaa Lgattcacaa 
24 60 



gatatgcttg ccaacttcag aggttctgaa 
2520 



ggactatttc agaaactgga aaatatcaat 
2580 



tgcacagtaa gggcactgct ccaggctatt 
2640 



gaagccaggc tcactgagga ggaaactgtc 
2700 



cgctgtggac tgaagaaaaU aaaaaatgac 
2760 



atgaagacag aactacagaa agcccagcag 
2820 



ctttatgatc tggacttggg caagttcggt 
2880 



ttggtaaaca agtctaagaa gattgtacay 
aataaaccca ttattctcag agctetctgt 
aagggggatg agtgtatcct gaaggacaac 
ccgggaggcg ttgacatgct tgttccctct 
ctggccgtgg acctctcttg caagattgag 
aaccagctct acatcaacat gaagagcctg 
gagaagatca gggccatgac aatcgccaag 
aagacgatag ccgaccttga gttacattac 
gagatgtttg gagatgatga caagcggaaa 
cattaccaga ccctggtcat tcagctccct 
actgaaatca ctcatcatgg aacctgccaa 
aacagagaaa atgacaagca agaaacatgg 
cagatagagc actgcgaggg caggatgact 
tcttctcacc acatcacagt gaaaattaac 
gcaattgctg aggttctcaa ccagcttaaa 
aagtactgct atttacagaa tgaagtattt 
ggtgttacag atggctactt aaatagctta 
ctccaaacag aagacatgtt aaaggtttat 
tgcctggacc tggataaagt ggaagcttac 
ttgaacttga agaagtcgtt gttggccact 
atccactctc agacttcaca gcagtaticca 
gaaaaagtca cacagctgac agaccgctgg 
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caaaggafcag ataaacagat 
2940 

ttgaggaatt atcgtgataa 
3000 

cgccaggatt ccttagaatc 
3060 

aatgagcaga agaacttgca 
3120 

caaaaaattg ctgaactttg 
3180 

tacacctcag gactggaaac 
3240 

ccttctgggg tgattctgca 
3300 

acaagatctg gagactatta 
3360 

aagctgaaaa ataccaagat 
3420 

aactcggaaa actgtaataa 
3480 

gagtgtlccc agttcaaagc 
3540 

ctggatggga agtcggctaa 
3600 

aatgagaaga tcacccgact 
3660 

gtggaagaca gatttgacca 
3720 

tgtgaaaagg agaaccttgg 
3780 

gagtacgaga ttgaaaggtt 
3840 

tatgaaaatg agctggcaaa 
3900 

aacaagtatg aaacagagat 
3960 

aaagaggatg attccaaaaa 
4020 

gatctgaagg atgaaattgt 
4080 

aggcgagctg aagaaaacgc 
4140 



aagcagcatc tggagataga 
4200 
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cgactttaga ttatgggacc 
ctatcaggct Itctgcaagt 
catgaaattt ggagattcca 
cagtgaaata tctggcaaac 
cgccaattca attaaggatt 
tctgctgaac atacctatca 
agaggctgca gatgttcatg 
caggttctta agtgagatgc 
cgaagttttg gaagaggagc 
gaacaaattc ctggatcaga 
gaagcttgcg agcctggagg 
gcaaaatcta gacaagtgct 
gacttalgag attgaagatg 
acagaagaat gactatgacc 
ttggcagaaa ttagagtctg 
gagggttcta ctgcaggaag 
ggtaagaaac cactataatg 
taacattacg aagaccacca 
tcttagaaac cagcttgata 
caggctcaat gacagcatct 
ccttcagcaa aaggcctgtg 
actgaagcag gtcatgcagc 



tggagaaaca aatcaagcaa 
ggctctatga tcgtaaacgc 
acacagtcat gcggtttttg 
gagacaaatc agaggaagta 
atgagctoca gctggcctca 
agaggaccat gattcagtcc 
ctcggtacat tgaactactt 
tgaagagttt ggaagatctg 
tcagactggc ccgagatgcc 
acctgcagaa ataccaggca 
agctgaagag acaggctgag 
acggccaaat aaaagaactc 
aaaagagaag aagaaaatct 
aactgcagaa agcaaggcaa 
agaaagccat caaggagaag 
aaggcacccg gaagagagaa 
aggagatgag taatttaagg 
tcaaggagat atccatgcaa 
gactttcaag ggaaaatcga 
tgcaggccac tgagcagcga 
gctctgagat aatgcagaag 
agcgctctga ggacaatgcc 
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cggcacaagc agtccctgga ggaggctgcc aagaccattc aqgacaaaaa taaggagatc 
4260 

gagagactca aagctyagtt tcaggaggag gccaagcgcc gctgggaata tgaaaaLgaa 
4 320 



ctgagtaagg taagaaacaa ttatgatgag gagatcatta gcttaaaaaa tcagtttgag 
4 380 



accgagatca acatcaccaa gaccaccatc caccagctca ccatgcagaa ggaagaggat 
4440 



accagtggct accgggctca gatagacaat ctcacccgag aaaacaggag cttatctgaa 
4500 



gaaataaaga ggctgaagaa cactctaacc cagaccacag agaatctcag gagggtggaa 
4560 



gaagacatcc aacagcaaaa ggccactggc tctgaggtgt ctcagaggaa acagcagctg 
4 620 



gaggttgagc tgagacaagt cactcagatg cgaacagagg agagcgtaag atataagcaa 
4 680 

tctcttgatg atgctgccaa aaccatccag gataaaaaca aggagataga aaggttaaaa 
4740 

caactgatcg acaaagaaac aaatgaccgg aaatgcctgg aagatgaaaa cgcgagatta 
4800 



caaagggtcc agtatgacct gcagaaagca aacagtagtg cgacggagac aataaacaaa 
4860 



ctgaaggttc aggagcaaga actgacacgc ctgaggatcg actatgaaag ggtttcccag 
4920 



gagaggactg tgaaggacca ggatatcacg cggttccaga aclctctgaa agagctgcag 
4980 



ctgcagaagc agaaggtgga agaggagctg aatcggctga agaggaccgc gtcagaagac 
5040 



tcctgcaaga ggaagaagct ggaggaagag ctggaaggca tgaggaggtc gctgaaggag 
5100 



caagccatca aaatcaccaa cctgacccag cagctggagc aggcatccat tgttaagaag 
5160 



aggagtgagg atgacctccg gcagcagagg gacgtgctgg atggccacct gagggaaaag 



cagaggaccc aggaagagct gaggaggctc tcttctgagg tcgaggccct gaggcggcag 
5280 



ttactccagg aacaggaaag tgtcaaacaa gctcacttga ggaatgagca tttccagaag 
5340 



gcgatagaag ataaaagcag aagcttaaat gaaagcaaaa tagaaattga gaggctgcag 
54 00 

tctctcacag agaacctgac caaggagcac ttgatgttag aagaagaact gcggaacctg 
54 60 



aggctggagt acgatgacct gaggagagga cgaagcgaag cggacagtga taaaaatgca 
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accatcttgg aactaaggag 
5580 

gggctgatta atgatttaca 
5640 



caaaagcagg ctttagaggc 
5700 



gtggtacagg aaagagagag 
5760 

aggclgcaga ggctggagga 
5820 

agggtgaaac agcgcctgga 
5880 

aagactcaat attcccgcaa 
5940 



agtgagagag agaagaacag 
6000 

agaattgaag agaggtgcag 
6060 

ttagaaacag aacgctcccg 
6120 

gggtcccatc gagagaccca 
6180 

tttgatgggc tgaggaagaa 
6240 

aaaacaacct tggacaaact 
6300 

atccagccat tccttcgggg 
6360 

aaatactctt tggtagaggc 
6420 

cttctggagg cccaggcagc 
6480 

actgtcgaca gtgccatagc 
6540 

gcagcagaaa aagctatcac 
6600 

tcagaagcca tcaagaaaaa 
6660 

cagattgctt cagggggtgt 
6720 

gccttggccc gggggctgat 
6780 

agtcagaaaa actttgtgga 
6840 



EP 1 355 149 A2 

ccagctgcag atcagcaaca 
gagagagagg gaaaatttga 
atctaatagg attcaggaat 
ccttctggtg aaaatcaaag 
tgagctgaat cgtgcaaaat 
gtgtgagaaa cagcaaattc 
ggaggaggct attaggaaga 
tcttaggagt gagatcgaaa 
gcgtaagctg gaggattcta 
atatcagagg gagattgata 
gactgagtgt gagtggaccg 
ggtgacagca atgcagctct 
attgaagggg aagaagtcag 
tgcaggatct atcgctggag 
caagagaaag aaattaatca 
tacaggtggt ataattgatc 
tcgggacctc attgacttcg 
tggttttgat gatccatttt 
tttgattgat agagaaaccg 
agtagaccct gtgaacagtg 
tgatagagat ttgtatcgat 
tccagtcacc aaaaagaagg 



accggaccct ggaactgcag 
gacaggaaat tgagaaattc 
caaagaatca gtgtactcag 
tcctggagca agacaaggca 
caactctaga ggcagaaacc 
agaatgacct gaatcagtgg 
tagaatcgga aagagaaaag 
gactccaagc agagatcaag 
ccagggagac acagtcacag 
aactcagaca gcgcccatat 
ttgacacctc caagctggtg 
atgagtgtca gctgatcgac 
tggaagaagt tgcttctgaa 
catctgcttc tcctaaggaa 
gcccagaatc cacagtcatg 
cccatcggaa tgagaagctg 
atgaccgtca gcagatatat 
caggcaagac agtatctgtt 
gaatgcgcct gctggaagcc 
tctttttgcc aaaagatgtc 
ccctgaatga tccccgagaL 
tcagttacgt gcagctgaag 
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gaacggtgca gaatcgaacc acatactggt 
6900 

atgtccttcc aaggaatcag acaacctgtg 
6960 

ttgagaccgt ccactgtcaa tgaactggaa 
7020 

gagagaatta aggacttcct ccagggttca 
7000 



acaaaacaga agcttggcat ttatgaggcc 
7140 



gctctggagL tgctggaagc ccaagcagct 
7200 

ttgaggttac cagtggagga agcctacaag 
7260 

aagctcctgt ctgcagaacg agctgtcact 
7320 



atctctttgt tccaagccat gaataaggaa 
7380 

ttagaagcac agatcgcaac cggggggatc 
7440 

gttgacatag catataagag gggctatttc 
7500 



ccaagtgatg ataccaaagg attttttgac 
7560 

caactaaaag aaagatgcat taaggatgag 
7620 



gaaaagaaga aacaggtgca gacatcacaa 
7680 



atagttgacc cagaaaccaa taaagaaatg 
7740 



attgattatg aaaccttcaa agaactgtgt 
7800 



atcacgggat cagatggctc caccagggtg 
7860 



tatgatattc aagatgctat tgacaagggc 
7920 

cgatccggca gcctcagcct cactcaattt 
7980 



ggcaccagca gcagcatggg cagtggtgtc 
8040 



gaatcagtaa gtaagatttc caccatatcc 
8100 



tctttttcag acaccctgga ayaatcgagc 
8160 



ctgctcttgc tttcagtaca gaagagaagc 
accgtcactg agctagtaga ttctggtata 
tctggtcaga tttcttatga cgaggttggt 
agctgcatag caggcatata caatgagacc 
atgaaaattg gcttagtccg acctggtact 
actggcttta tagtggatcc tgttagcaac 
agaggtctgg tgggcattga gttcaaagag 
gggtataatg atcctgaaac aggaaacatc 
ctcatcgaaa agggccacgg tattcgctta 
attgacccaa aggagagcca tcgtttacca 
aatgaggaac tcagtgagat tctctcagat 
cccaacactg aagaaaatct tacctatctg 
gaaacagggc tctgtcttct gcctctgaaa 
aagaataccc tcaggaagcg tagagtggtc 
tctgttcagg aggcctacaa gaagggccta 
gagcaggaat gtgaatggga agaaataacc 
gtcctggtag atagaaagac aggcagtcag 
cttgttgaca ggaagttctt tgatcagtac 
gctgacatga tctccttgaa aaatggtgtc 
agcgatgatg tttttagcag ctcccgacat 
agcgtcagga atttaaccat aaqgagcagc 
cccattgcag ccatctttga cacagaaaac 
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ctggagaaaa tctccattac agaaggtata gagcggggca tcqttgacag catcacgggt 
8220 

cagaggcttc tggaggctca ggcctgcaca ggtggcatca tccacccaac cacgggccag 
8280 

aagctgtcac ttcaggacgc agtctcccag ggtgtgattg accaagacat ggccaccagc 
8340 

gtgaagcctg ctcagaaagc cttcataggc ttcgagggtg tgaagggaaa gaagaagatg 
8400 

tcagcagcag aggcagtgaa agaaaaatgg ctcccgtatg aggctggcca gcgcttcctg 
8460 

gagttccagt acctcacggg aggtcttgtt gacccggaag tgoatgggag gataagcacc 
8520 

gaagaagcca tccggaaggg gttcatagat ggccgcgccg cacagaggct gcaagacacc 
8580 

agcagctatg ccaaaatcct gacctgcccc aaaaccaaat taaaaatatc ctataaggat 
8640 

gccataaatc gctccatggt agaagatatc actgggctgc gccttctgga agccgcctcc 
8700 

gtgtcgtcca agggcttacc cagcccttac aacatgtctt cggctccggg gtcccgctcc 
8760 

ggctcccgct cgggatctcg ctccggatct cgctccgggt cccgcagtgg gtcccggaga 
8820 

ggaagctttg acgccacagg gaattcttcc tactcttatt cctactcatt tagcagtagt 
8880 

tctattgggc actagtagtc agttgggagt ggttgctata ccttgacttc atttatatga 
8940 

atttccactt tattaaataa tagaaaagaa aatcccggtg cttgcagtag agtgatagga 
9000 

cattctatgc ttacagaaaa tatagccatg attgaaatca aatagtaaag gctgttctgg 
9060 

ctttttatct tcttagctca tcttaaataa gcagtacact tggatgcagt gcgtctgaag 
9120 

tgctaatcag ttgtaacaat agcacaaatc gaacttagga tttgtttctt ctcttctgtg 
9180 

tttcgatttt tgatcaattc tttaattttg gaagcctata atacagtttt ctattcttgg 
9240 

agataaaaat taaatggatc actgatattt tagtcattct gcttctcatc taaatatttc 
9300 

catattctgt attaggagaa aattaccctc ccagcaccag cccccctctc aaacccccaa 
9360 

cccaaaacca agcattttgg aatgagtctc ctttagtltc agagtgtgga ttgtataacc 
9420 " ' 

catatactct tcgatgtact tgtttggttt ggtattaatt tgactgtgca tgacagcggc 
9480 
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aatcttttct ttggtcaaag ttttctgttt attttgcttg tcatattcga tgtactttaa 
9540 



ggtgtcttta tgaagtttgc tattctggca ataaactttt agactttt 
9588 



<210> 


33 


<211> 


366 


<212> 


DNA 


<213> 


human 


<220> 




<221> 


misc feature 


<222> 


(351).. (351) 


<223> 


any kind of 1 


<400> 


33 



gaagtgccat ttatatttat acaaaaatat tacataattc agttagliatt ggtgacataa 
60 



tttagttagt atgggtgata taatggtcat aatttttagc atctaataaa gatcttttta 
120 



tgagtcccat ataaaatatg tgaacaaagc aatcttgtca taagatttgt gatgatttag 
180 



gagaaagtac tttgagataa tttttttctg tctctttgtg aactctctca acagtagttc 
240 



tctttagatt agagccagca ggtcggccat aacagttttc ttcaaatttg ggcaacagtt 
300 

atacaaatgc ttgaatttca agacaacata ttaaagggtc tatgaactgg naatctaacc 
360 



tgggtt 
366 



<210> 34 

<211> 1466 

<212> DNA 

<213> human 

<400> 34 

agccccaagc ttaccacctg cacccggaga gctigtgtgtc accatgtggg tcccggttgt 
60 

cttcctcacc ctgtccgtga cgtggattgg tgctgcaccc ctcatcctgt ctcggattgt 
120 



gggaggctgg gagtgcgaga agcattccca accctggcag gtgcttgtgg cctctcgtgg 
180 



cagggcagtc tgcggcggtg ttctggtgca cccccagtgg gtcctcacag ctgcccactg 
24 0 



catcaggaac aaaagcgtga tcttgctggg tcggcacagc ctgtttcatc ctgaagacac 
300 



aggccaggta tttcaggtca gccacagctt cccacacccg ctctacgata tgagcctcct 
360 
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gaagaatcga ttcctcaggc caggtgatga ctccagccac gacctcatgc tgctccgcct 
420 

gtcagagcct gccgagctca cggatgctgt gaaggtcatg gacctgccca cccaggagcc 
480 



agcactgggg accacctgct acgcctcagg ctggggcagc attgaaccag aggagttctt 
540 



gaccccaaag aaacttcagt gtgtggacct ccatgttatt tccaatgacg tgtqtgcgca 
600 



agttcaccct cagaaggtga ccaagttcat gctgtgtgct ggacgctgga cagggggcaa 
660 



aagcacctgc tcgggtgatt ctgggggccc acttgtctgt aatggtgtgc ttcaaggtat 
720 



cacgtcatgg ggcagtgaac catgtgccct gcccgaaagg ccttccctgt acaccaaggt 
780 



ggtgcattac cggaagtgga tcaaggacac catcgtggcc aacccctgag cacccctatc 
840 



aaccccctat tgtagtaaac ttggaacctt ggaaatgacc aggccaagac tcaagcctcc 
900 



ccagttctac tgacctttgt ccttaggtgt gaggtccagg gttgctagga aaagaaatca 
960 



gcagacacag gtgtagacca gagtgtttct taaatggtgt aattttgtcc tctctgtgtc 
1020 



ctggggaata ctggccatgc ctggagacat atcactcaat ttctctgagg acacagatag 
1080 



gatggggtgt ctgtgttatt tgtggggtac agagatgaaa gaggggtggg atccacactg 
1140 



agagagtgga gagtgacatg tgctggacac tgtccatgaa gcactgagca gaagctggag 
1200 



gcacaacgca ccagacactc acagcaagga tggagctgaa aacataaccc actctgtcct ■ 
1260 



ggaggcactg ggaagcctag agaaggctgt gagccaagga gggagggtct tcctttggca 
1320 



tgggatgggg atgaagtaag gagagggact ggaccccctg gaagctgatt cactatgggg 
1380 



ggaggtgtat tgaagtcctc cagacaaccc tcagatttga tgatttccta gtagaactca 
1440 



cagaaataaa gagctgttat actgtg 
1466 



<210> 35 

<211> 187 

<212> DNA 

<213> human 

<400> 35 

gatctggtgc attccggtcg acactctcgt ttatttggac tgtaagtctg acctctatga 
60 
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ataattactt cagcccctga gtgctcccgg gccaagctcc ttggccaaac tttcacctta 
120 

gcttctgata agtcttgggc caagctaagc agcatctatc aatcatccct tcagctcctg 
180 

attgatc 
187 



<210> 36 

<211> 2913 

<212> DNA 

<213> human 

<400> 36 

actgggtacc gaggactggg tgtgtttaag gcagacagcc aggtgaggat cccagctact 
60 

ggggcctgct gtcatctcct gggagtaccc gggggtcagg agcctagggg actcttgcac 
120 

ttcacatcca gccatgctaa ttacactttt tggcaaagga aacagctagg agcagtttct 
180 

ttcactccta cagccccgtt ttctcagtgt ttagacctcg aattattact gggctagagg 
240 

gaaggcagcc tctgaagtgt qgcaggagga ggggaagtct gcctgcatct tggtgtgtct 
300 

gtcagatgcc agcactaata acctggcttc tgtgaggcct gtcagtgctc tcaggaatga 
360 

aaggggaccc ctgagaggtg ctcagtacca gcaggctgtg aatgctctct acccaccacc 
420 

ctcacctcct cgttaaagat ggtgctacct gccacacagc agacatctgg tcgctgcaca 
480 

cccgaaagac cccaaggcag tctgcccctt gtccagccac acgccagcac ccaccctcct 
540 

ggcccctgcc tcggcctccc cagaccagct gcacccagcc cccaacacgc accccttctc 
600 

cagatgtgtg cagggcctca ttttgcagag caaagacaga tgtttcagcc acacgcttta 
660 

ttaacttcta aaacctgtgc tcaggacact cttcaacagt catgaaaagt ttgatcactt 
720 

gccacagtca ggacctttgt gtggggctct gatctgatgt tcgglctcat catctcccaa 
780 

accagcagtc gtttgtaccc caaccctctg ctcaggggct cataccccca aatgattttc 
84 0 

ctgatttatg tatttcccta caaagggctt tctataccta gcatctgcct ccagcatgag 
900 

aagggggaat aggtgagacc catttgccag tagcagacgg ggaccctggg gagaaaatgg 
960 

cagagcctgt tggagactcc ctgtctccag ctgaccagcc aatgggattc ctcttccctc 
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1020 

cactgtctcc cacaaagtag aagaatcctg 
1080 

ccctatctag acatgaggcc ctttagacat 
1140 

gggtcgggga ggcagagggg atgctcacac 
1200 

tcacctgggg agagttcaca aaatactggt 
1260 

ggtgggtctg gggtgtggcc caggcatcat 
1320 

gcagcccagc taagccccta gagccttgca 
1380 

tttgagggaa atgcctaact tcaggggccg 
1440 

agataccagg cccaccccca gagatgagct 
1500 

agtgtttttc acaagctcca tacctccagg 
1560 

cattctgaga gtcctgttgc ctgtgccttg 
1620 

gccttgagga ggatgtgcat taacgtggta 
1680 

tgtcccaccg gggacctcca aaaacttcat 
1740 

agaggatgag gctggcggat ttagtaagag 
1800 

agttgccctg tcatctggcc tctggataac 
1860 

gatggcgaaa gacggccaca tttagtgaga 
1920 

acagtggccc ctggtgcatg gaccacacac 
1980 

tctgaaatta gggagatatg aatgtctttc 
2040 

ttgcttgggg gtccttggtc aaggccagct 
2100 

gtctcatttt gctgtactgc aaactcaggc 
2160 

tcccctagta gggtttgttt tggggtcaca 
2220 

cagcctctac atcagggaga gaggtaggta 
2280 

ctaaagtaca acagccttgg aggaactgcc 



gtacatttag cccatgagcc tggcacagat 
gactttggca ttgaccagcc tgttggcaat 
cagtaattct catcccctga atgcttggga 
gcaggggtcc cacctctgat gatgctgagt 
gatgtttcag gcccccaggt gacttcttag 
atttccccca aatgacctca gagggcccga 
taagaatccc ccagggagca tgtgaaatgc 
gaggtgggtc aggggtgaag tgcagggatc 
aaatggtgtt gtggttgggc ccgtagaaaa 
gtgcacgtgg ggtggaatcc cagtggccct 
ggggagacag agacagctcc acctgccccc 
ggatgttaga gcaagcagcc atgctgcagc 
ccctctgtgt ttgggctgag ttctttctct 
ccacctctcc tccctcatcc taaaattaca 
cccctaaggt cctccaacta gggtgggtcc 
tctcttccct cctctggctc aggactacgg 
ttgaaaactt ctcttcccag tcttcccact 
ttggactagg gcttgttgcg actaccagct 
ttggttccaa gcttatgggg gccctgtcct 
tctggtcata cccttcagag agctcttccc 
gggaggagca ttcaaggatt agaagaagga 
aggaactaag ggcgagcact ggagaaggca 
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2340 



acctgggacc ccctgcgctt ctgagcagga agaccaagac cttcaggggc cctaagcact 
2400 



gaaaacatca ttcctcatcc ccaagccctg gcatccccct gttcttctaa aataattctt 
24 60 



ttctaggtat ttctgattgc aaaattctgg atgggttcat ccaagctgac ctttgctgtt 
2520 



ttttcccttc ccaacaaggc ctcacttttt ggagccacct tagttggtgc ctaggcagag 
2580 



gggcagtcag cagtggltat caggatcctg gctctatygg ttgccttcct cctggtctgt 
2640 



aaagcccctg caggcaggga cttcttagat agctgcttcc ttagggcatg gcatgtggtg 
2700 



ggtggttaat gaatggaaga gagggaatga gtgatcaagg gagggaggag ggagtggagt 
2760 



ggagatttct catcctttcc tgttaattta tgacatcctc ctgcctatga gtccttgact 
2820 



ctggagtttt acaaagcagt cacatttcaa ataaaagtct gggaaagcaa cacatcatcg 
2880 



ccaactttta attttgctaa ataaggatat tag 
2913 



<210> 37 

<211> 3466 

<212> DNA 

<213> human 

<400> 37 

agccccaagc ttaccacctg cacccggaga gctgtgtgtc accatgtggg tcccggttgt 
60 

cttcctcacc ctgtccgtga cgtggattgg tgctgcaccc ctcatcctgt ctcggattgt 
120 



gggaggctgg gagtgcgaga agcattccca accctggcag gtgcttgtgg cctctcgtgg 
180 



cagggcagtc tgcggcggtg ttctggtgca cccccagtgg gtcctcacag ctgcccactg 
240 



catcaggaac aaaagcgtga tcttgctggg tcggcacagc ctgtttcatc ctgaagacac 
300 



aggccaggta tttcaggtca gccacagctt cccacacccg ctctacgata tgagcctcct 
360 



gaagaatcga ttcctcaggc caggtgatga ctccagccac gacctcatgc tgctccgcct 
420 



gtcagagcct gccgagctca cggatgctgt gaaggtcatg gacctgccca cccaggagcc 
480 

agcactgggg accacctgct acgcctcagg ctggggcagc attgaaccag aggagttctt 
540 
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gaccccaaag aaacttcagt gtgtggacct 
600 

agttcaccct cagaaggtga ccaagttcat 
660 

aagcacctgc tcgggtgatt ctgggggccc 
720 

cacgtcatgg ggcagtgaac catgtgccct 
780 

ggtgcattac cggaagtgga tcaaggacac 
840 

aaccccctat tgtagtaaac ttggaacctt 
900 

ccagttctac tgacctttgt ccttaggtgt 
960 

gcagacacag gtgtagacca gagtgtttct 
1020 

ctggggaata ctggccatgc ctggaqacat 
1080 

gatggggtgt ctgtgttatt tgtggggtac 
1140 

agagagtgga gagtgacatg tgctggacac 
1200 

gcacaacgca ccagacactc acagcaagga 
1260 

ggaggcactg ggaagcctag agaaggctgt 
1320 

tgggatgggg atgaagtaag gagagggact 
1380 

ggaggtgtat tgaagtcctc cagacaaccc 
1440 

cagaaataaa gagctgttat actgtg 
1466 



<210> 38 

<211> 462 

<212> DNA 

<213> human 

<220> 

<221> misc_feature 

<222> (197) . . (197) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (116) . . (116) 

<223> any kind of base 



55 149 A2 

ccatgttatt tccaatgacg tgtgtgcgca 

gctgtgtgct ggacgctgga cagggggcaa 

acttgtctgt aatggtgtgc ttcaaggtat 

gcccgaaagg ccttccctgt acaccaaggt 

catcgtggcc aacccctgag cacccctatc 

ggaaatgacc aggccaagac tcaagcctcc 

gaggtccagg gttgctagga aaagaaatca 

taaatggtgt aattttgtcc tctctgtgtc 

atcactcaat ttctctgagg acacagatag 

agagatgaaa gaggggtggg atccacactg 

tgtccatgaa gcactgagca gaagctggag 

tggagctgaa aacataaccc actctgtcct 

gagccaagga gggagggtct tcctttggca 

ggaccccctg gaagctgatt cactaLgggg 

tcagatttga tgatttccta gtagaactca 
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<220> 

<221> misc_feature 

<222> {334) . . (334) 

<223> any kind of base 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



<220> 

<221> misc_feature 

<222> (402) (402) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (429) . . (429) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (438).. (430) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (443) (443) 

<223> any kind of base 



<220> 

<221> raisc_feature 

<222> (459) . . (459) 

<223> any kind of base 



<400> 38 

taaggtttta 

60 

ttattttcag 
120 

aaaaaagagt 
180 

catgtgcaca 
240 

tccagtaact 
300 

atttttcata 
360 

atccctgggg 
420 

tcacagggna 
462 



taattatttt tatttttctt 



atccaatact 
tgtatttttt 
acgcagnggt 
cgtcatttaa 
gcttaaaaat 
gaataaattt 
aaaggganat 



agaagttgtt 
ttttttgctt 
tagctacata 
cattagglat 
cattgacata 
tgtcttaaca 
ccncccattt 



ttcltttttt tttatggctt ggatgacact 
tccatgttca cattttcctt cctggnttaa 
tttttaaatt atactttaag ttttagggta 
tgtatacatg tgccatgttg gcgtgctgca 
atctccaaat gctatccttc cccccattgt 
ggantaattc caactaaagt acggtattaa 
agggtaaggt tngtgaaaag gatggttttg 
taaaacccnc ct 



<210> 39 

<211> 1490 

<212> DMA 

<213> human 
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<400> 39 

ctcgtgcccc ccacggagqq gactgctctc 
60 

acccacctca gcacctgaga gggtgaaata 
120 

tttgagaagt ctcggtcggt aagggaagtc 
180 

cacctgcctc ctcttcggcc accccccaga 
240 

cacgactctg accatagtct tctctcagct 
300 

gttctgtgaa caagtccatg ctgccatcaa 
360 

aaaggatcag gggatcaccc tgagaaagct 
420 

tggcatggct cagctcatgg aagtactttc 
480 

tgaccttatt tcctacaaca gtgtctgggt 
540 

agataacaaa gctgcagctc ttttgatgct 
600 

acatgaagaa atggagcagg ctgtggaaga 
660 

tactgaggag aacaactctg acaaccacaa 
720 

caatcaggac ttgtattggt cagaggacga 
780 

ggtgagagca tccaaagcct gcctgaagaa 
840 

gaaggatcag gtggcacagc tggatgacat 
900 

tgtggatgat ttggctctga gcatatatcc 
960 

ttctgcgaaa cttgtatctg ttttaaagaa 
1020 

gacccctcag ccagaagata gttggatccc 
1080 

gaatagaatc aaggagctca ctcagagtga 
1140 

tactctcttc ccctctcatc gtcatggtca 
1200 



agaatgcttg ctggattgaa agggagtgcc 
1260 

aaagattgll ggttaggcca gattgacacc 



55149 A2 

ccccgctgca tcctttctgt gaggtacctt 

gaattctaac ctcgacattc gggaagtgtt 

ttccaagtcc gtgcagcact aacgtattgg 

tgaggcagct gtgactgtgt caagggaagc 

tcc.actgccg tctccacagg aaacccagaa 

ggcatttatt gcagtgtact atttgcttcc 

ggtacggggc gccaccctgg acatcgtgga 

cgtcactcca actcagagcc ctgagaacaa 

tgcgtgccag cagatgcctc agataccaag 

gaccaagaat gtggattttg tgaaggatgc 

atgtgaccct tactctggcc tcttgaatga 

tcatgaggat gatgtgttgg ggtttcccag 

tcaagagctc ataatcccat gccttgcgct 

aattcggatg ttagtggcag agaatgggaa 

tgtggatatt tctgatgaaa tcagccctag 

acctatgtgt cacctgaccg tgcgaatcaa 

ggcacttgaa attacaaaag caagtcatgt 

tttacttatt aatgccattg atcattgcat 

acttgaatta tgacttttca ggctcatLtg 

ggctctgata cctgctttta aaatggagct 

tatctatatt tagcaagaga cactattacc 

tatttataaa ccatatgcgt atatttttct 
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1320 



gtgctatata tgaaaaataa ttgcatgatt tctcattcct qagtcatttc tcagagattc 
1380 



ctaggaaagc tgccttattc tctttttgca gtaaagtatg ttgttltcat tgtaaagatg 
1440 



ttgatggtct caataaaatg ctaacttgcc agtgattaaa aaaaaaaaaa 
1490 



<210> -40 

<211> 1677 

<212> DNA 

<213> human 

<400> 40 

cttgacccta tttatagtgg ctctaaaggt ggtgttatta tgttttctag agcacttcga 
60 



ttatacaaac gtcaaggaat ccgagttaat gtgctttgcc ctgagtttgt tgaaacagac 
120 



atgggcacaa tgatcggtcc caaattcctt agtatgatgg ggggctttgt acctatggaa 
180 



atggtggtga aaggtgcttt tgagctcatc actgatgaga ataaagccgg cgattgccta 
24 0 



tggattacta atcggcgagg tcttgagtac tggcccaccc catcagaaga agcaaagtac 
300 



ttgctgcgtt ctacacgttc caggagaaga actgaataca aagctccacc aattaaacta 
360 

cctgagagtt ttgagaaaat agttgttcag accttgactc acaactttcg gaatgctacc 
420 



agtgtagtaa gagcaccact gagattacct atcaaaccaa actatgttct tgtgaagata 
480 

atctatgctg gtgtaaatgc tagtgatgta aattttagct caggtcgcta ttttggtggc 
540 

aataacagtg acactgcatc ccgtcttccg tttgatgcag gatttgaggc tgtgggagta 
600 

attgcagcag ttggggattc tgttactgac ttgaaagttg gcatgccttg tgcgttcatg 
660 

acttttggag gctatgctga atttacaatg attccttcga aatacgccct tccaatgcct 
720 



agaccagaac cggaaggtgt tgccatgctt acatcaggat taacagcttc aattgctcta 
780 



gaaaaggcag gacagatgga atctggaaaa gtggtccttg ttactgctgc ggcaggagga 
84 0 



actggtcagt ttgctgttca gcttgcaaaa ttagctggta ataccgtggt tgccacttgt 
900 



ggaggtgggg caaaggccaa gctCctgaaa gaattgggag tcgacagagt catagactat 
960 
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cacagtgaag atataaaaac 
1020 

tacgaatotg ttggtgggga 
1080 

cgactcattg tcattggcat 
1140 

aaatatcctg gactatgtga 
1200 

ctggtgcaat atagtcacat 
1260 

tccggaaaac taaaggttgc 
1320 

gatgctgttg agtatctcca 
1380 

ccgaccttcg gLcatcaagt 
1440 

agtgaagttt tcaattctta 
1500 

agaccagtgc tggaatattt 
1560 

aatccattta tgtataccat 
1620 

gcgagatatc Lacaaaataa 
1677 



<210> 41 

<211> 1330 

<212> DNA 

<213> human 

<400>- 41 

atggcgcagt gggacagctt 
60 

gtgaagtttg atgctcgctc 
120 

ccacttcaag aaaagctgaa 
180 

tttgctgttc tcattcctat 
240 

aagaattgca cagttggttc 
300 

ggaaatgaca gtgaagatga 
360 

atggagaaaa gaatccaata 
420 

tttcaaaatt tcagtgtgac 
480 
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ggttctaagg aaagagttcc 
catgttaaag ttgtgcttgg 
gatttctcag Latcaaggag 
gaagctcttg tcaaagagtc 
gtaccaagaa caccttaaca 
tgtggatcca aagagattta 
ttcaggcaaa agcgttggga 
agccaaatta tgaatgaaca 
gtctagagat tgttctcgaa 
attctcaatg ctttttcaat 
gtttatgttt acactataca 
attataatcc tttcatttta 

cactgatcaa caggaggaca 
caatacagct ttgcttcccc 
atccttcaaa gctgcactga 
catcgcaata atggcagctc 
aattaatgca aacagtgtat 
agtgagattt cgagaagttg 
latttcagat actgaagaaa 
aactgatcaa cgatttgctg 



cgaaaggtat tgatatcatc 
atgctttggc agtccatgga 
aaaatggttg gacgccatca 
aaactgtggc tggctttttc 
agttatttga cctttactct 
taggccttca ttctgttgct 
aggtggttgt ctgcgtggac 
cggtgtcaaa tacagaaaga 
tgttactgaa aatagctgct 
tttggattac ttgaaagaat 
acaactatga gcagaagaaa 
aaaaaaaaaa aaaaaaa 

ctgatagctg ttcagaatct 
caaatcctaa aaatggccct 
ttgcccttta tctccttgtg 
aactcctgaa gtgggaaatg 
cctccagtct cctgggaaga 
ttatggaaca cattagcaag 
atctcgtaga ttcagagcat 
atgttcttct ccaactaagt 
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accttggttc ccacagtcca gggacatggg aatgccgtag atgaaatcac caggtcctta 
540 

ataagtctga ataccacgct gcttgatttg cacctctatg tagaaacact gaatgtcaaa 
600 

ttccaggaga atacacttaa agggcaagag gaaatcagca aattaaagga gcgtgtgcac 
660 

aatgcatcag cagaaattat gtctatgaaa gaagaacaag tgcatttgga acaggaaata 
720 

aaaagagaag tgaaagtcct gaataacatc actaatgatc tcaggctgaa agattgggaa 
780 

cattctcaga cgttgagaaa tatcacttta attcaaggtc ctcctggacc cccaggagaa 
840 

aaaggagata gaggtccaac cggagaaagt ggtccaccag gcgttccagg tccagtaggt 
900 

cctccaggtc ttaagggtga tcgaggatct attggctttc cgggaagtcg aggatatcca 
960 

ggacaatcag ggaagactgg gaggacagga tatcctggac caaaaggcca aaaggqaqaa 
1020 

aaaggcagtg gaagcatcct gactccttct gcgactgtcc gactggttgg Lggccgtggc 
1080 

cctcatgagg gtagagtgga gatattgcac aatggacagt ggggcacagt ttgtgatgat 
1140 

cactgggaac tgcgtgccgg gcaggttgtc tgcaggagct tgggataccg aggtgttaag 
1200 

agtgtgcaca agaaagctta ttttggacaa ggtactggtc ccatttggct gaatgaagta 
1260 

ccctgtttgg ggatggagtc atccattgaa gagtgcaaaa tcagacagtg gggcgtgaga 
1320 

gtctgttcac 
1330 



<210> 42 

<211> 431 

<212> DNA 

<213> human 

<220> 

<221> misc_feature 

<222> (97).. (97) 

<223> any kind of base 



<220> 

<221> misc_feature 

<22l> (347) . . (347) 

<223> any kind of base 



<220> 

<221> misc feature 
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<222> (349) . . (349) 
<223> any kind of base 



5 



<220> 



<221> misc_feature 
<222> (361).. (361) 
<223> any kind of base 



to 



<220> 

<221> misc_feature 

<222> (362) (362) 

<223> any kind of base 



15 



<220> 

<221> misc_feature 

<222> (363) (363) 

<223> any kind of base 



20 



<22Q> 

<221> misc_feature 

<222> (401) . . (401) 

<223> any kind of base 



25 



<220> 



<221> misc_feature 
<222> (428) . . (428) 
<223> any kind of base 



<400> 42 

ctttttatat ttattttcat cgctacacaa acatt.tttta ggagtttgat tctacctcca 
60 

ttttggttag atatacaaac tctaccccat gagggantgt atggtgtatt tctagattta 
120 

gcaacaattt tcttgaaaaa tgtacaatac tatagaaaaa tgaagatagt aaataccagg 
180 

tataagttaa taacagtgtt tcttttgttc agtaataatg aactgtgtac tagcactgaa 
240 

ctttaggccc tcctatttgc gtattttctg tttgtatatt tttaaataga ggaattgtga 
300 . 

ttataatatt attattttgg aatatcctaa atcataaatt caaaacntna tttagttttt 
360 

nnnttttttt tttaagatgg agtcccgctt tgtcccaggc nggagtgcag tggcatgatc 
420 

tcagctcnct g 



<210> 43 

<211> 669 

<212> DNA 

<213> human 

<220> 



431 



50 
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<221> misc_feature 
<222> (641) . . (641) 
<223> any kind of base 



<4 00> 4 3 

ttcttttgga aaaccaaaca tgctttattt catttttttc acaatttatt taaacatctc 
60 

acatatacaa aataggtaca atttaatttt tctgcttgcc caagaaacaa agcttctgtg 
120 

gaaccatgga agaagatgaa aatgagactg gcaaagaaca aatgctgaat ctgaagaaga 
180 



ggacaacttt gggcaaataa tctgcatact tttaattggg aataagatgg aaaatatgaa 
240 

tgctaaatca aattttttaa aaaatacacc acacgataca actcaataca ggagtatttc 
300 

ttctcaaatt cttctagcac catcaacatt cttcaagtat ctgaaatact attaattagc 
360 

acctttgtat tatgaacaaa acaaaacaag gacctcagtt' catctctgtc taggtcagca 
420 

cctaacaatg tggatcacac tcatgggaaa gtgttttgag gtagtttaaa cctttggaag 
480 

tttgggtttt aaacttccct ctgtggaaga tattcaaaag ccacaagtgg tgcaaatgtt 
540 

tatggttttt atttttcaat ttttattttg gttttcttac aaaggttgac atttttcata 
600 

acaggtgtaa gagtgttgaa aaaaaaattt caatttttgg ngggaacggg ggaaggagtt 
660 



aatgaaact 
669 



<210> 44 

<211> 287 

<212> DNA 

<213> human 

<400> 44 

gccggagagt ctacaatgtt acccagcatg ctgttggcat tgttqtaaac aaacaagtta 
60 

i 

agggcaagat tcttgccaag agaattaatg tgcgtattga gcacattaag cactctaaga 
120 



gccgagatag cttcctgaaa cgtgtgaagg aaaatgatca gaaaaagaaa gaagccaaag 
180 



agaaaggtac ctgggttcaa ctaaagcgcc acgctgctcc acccagagaa gcacactttg 
240 



tgagaaccaa tgggaaggag cctgagctgc tggaacctat tccctat 
287 



<210> 45 
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<211> 383 

<212> DNA 

<213> human 

<220> 

<221> misc_feature 

<222> (147) . . (147) 

<223> any kind of base 



<220> 

<221> misc__feature 

<222> (309) . . (309) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (349) . . (349) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (365).. (365) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (372) . . (372) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (380) . - (380) 

<223> any kind of base 



<400> 45 

ggaacggaaa aggagaattc aagtgtgacc ctcatgaggc aacgtgttat gatgatggga 
60 

agacatacca cgtaggagaa cagtggcaga aggaatatct cggtgccatt tgctcctgca 
120 

catgctttgg aggccagcgg ggctcgnctt gtgacaactg ccgcagacct ggggggtgaa 
180 

cccagtcccg aaggcactac tggccagtcc tacaaccagt attcttcaga gataccattc 
240 

agagaacaaa cactaatgtt taatttgccc aatttgagtg cttcatgcct tttaggatgt 
300 

tacaggctng acagagaagg ttttcccgag gagttaaatc atctttttnc catttcccga 
360 

ggggncaagg cntgtttttn ttt 
383 



<210> 46 
<211> 523 
<212> DNA 



99 



EP 1 355 149 A2 



<213> human 
<400> 46 

cagaggggca gggcggacgg ctaggagttc aagaaacatc ctggtctgag ggaaaggctg 
60 

cagctgcacc gccatgaata agcttttcag cttctggaag aggaagaatg agacccgcag 
120 

ccagggctac aaccttcgag aaaaggattt aaagaaactt cacagagctg cttcagtcgg 
180 

ggatttgaag aagctgaagg aataccttca gatcaagaaa tatgatgtaa atatgcagga 
240 

ctatgaatac agaacacctt tgcacctagc ctgtgctaat ggacatacag atgttgtact 
300 

tttcctaatt gagcaacaat gcaagataaa tgtccgggat agtgaaaaca aatccccatt 
360 J 

gattaaggca gtacagtgtc aaaatgagga ttgtgctact attctgctaa actttggtgc 
420 

agacccagat ctgagggata ttcgttataa tactgttctt cactatgctg tttgtggtca 
4 80 

aagtttgtca ttagttgaaa aactgcttga atacgaagct gat 
523 



<210> 47 

<211> 390 

<212> DNA 

<213> human 

<400> 47 

tccaaggtca tggcaaaaca tctgaagttc atcgccagga ctgtgatggt acaggaaggg 
60 

aacgtggaaa gcgcatacag gaccctaaac agaatcctca ctatggatgg gctcattgag 
120 

gacattaagc atcggcggta ttatgagaag ccatgccgcc gcgacagagg gaaagctatg 
180 

aaaggtgccg gcggatctac aacatggaaa tggctcgcaa gatcaacttc ttgatgcgaa 
240 

agaatcgggc agatccgtgg cagggctgct gaygcctgtg ggtgggacac cagtgcgaaa 
300 

ccctcatcca gttttctctc catctctttt ctttgtacaa tcccatttcc tattaccatt 
360 

ctctgcaata aactcaaatc acatgtctgc 
390 



<210> 48 

<211> 669 

<212> DNA 

<213> human 

<220> 

<221> misc feature 
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<222> (641) (641) 
<223> any kind of base 



ttcttttgga aaaccaaaca tgctttattt catttttttc acaatttatt taaacatctc 
60 

acatatacaa aataggtaca atttaatttt tctgcttgcc caagaaacaa agcttctgtg 
120 

gaaccatgga agaagatgaa aatgagactg gcaaagaaca aatgctgaat ctgaagaaga 
180 

ggacaacttt gggcaaataa tctgcatact tttaattggg aataagatgg aaaatatgaa 
240 

tgctaaatca aattttttaa aaaatacacc acacgataca actcaataca ggagtatttc 
300 

ttctcaaatt cttctagcac catcaacatt cttcaagtat ctgaaatact attaattagc 
360 

acctttgtat tatgaacaaa acaaaacaag gacctcagtt catctctgtc taggtcagca 
420 

cctaacaatg tggatcacac tcatgggaaa gtgttttgag gtagtttaaa cctttggaag 
480 

tttgggtttt aaacttccct ctgtggaaga tattcaaaag ccacaagtgg tgcaaatgtt 
540 

tatggttttt atttttcaat ttttattttg gttttcttac aaaggttgac atttttcata 
600 

acaggtgtaa gagtgttgaa aaaaaaattt caatttttgg ngggaacggg ggaaggagtt 
660 

aatgaaact 
669 



<210> 49 

<211> 431 

<212> DNA 

<213> human 

<220> 

<221> misc_feature 

<222> (97).. (97) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (347) . . (347) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (349) . . (349) 

<223> any kind of base 



<220> 
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<221> misc_feature 
<222> (361)-. (361) 
<223> any kind of base 



<220> 

<221> misc_feature 

<222> (362) (362) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (363).. (363) 

<223> any kind of base 



<220> 

<221> misc_feature 

<222> (401) (401) 

<223> any kind of base 



<220> 

<221> raisc_feature 

<222> (428) . . (428) 

<223> any kind of base 



ctttttatJt ttattttcat cgctacacaa acatttttta ggagtttgat tctacctcca 
60 

ttttggttag atatacaaac tctaccccat gagggantgt atggtgtatt tctagattta 
120 

gcaacaattt tcttgaaaaa tgtacaatac tatagaaaaa tgaagatagt aaataccagg 
180 

tataagttaa taacagtgtt tcttttgttc agtaataatg aactgtgtac tagcactgaa 
240 

ctttaggccc tcctatttgc gtattttctg tttgtatatt tttaaataga ggaattgtga 
300 

ttataatatt attattttgg aatatcctaa atcataaatt caaaacntna tttagttttt 
360 . 

nnnttttttt tttaagatgg agtcccgctt tgtcccaggc nggagtgcag tggcatgatc 
420 , : 

tcagctcnct g 
431 



Claims 

1 A method of assessing colorectal cancer status comprising identifying differential modulation of each gene (relative 
to the expression of the same genes in a normal population) in a combination of genes selected from the group 
consisting of Seq. ID. No. 42-49. 
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2. The method of claim 1 wherein there is at least a 2 fold difference in the expression of the modulated genes. 

3. The method of claim 1 wherein the p-value indicating differential modulation is less than .05. 

4. The method of claim 1 further comprising employing a colorectal diagnostic that is not genetically based. 

7 ' 2neSerel!° ^ * ^ 6 8 *** " ,he differential - ■» 0— con- 

8. The diagnostic portfolio of claim 7 wherein said matrix is employed in a microarray. 

9. The diagnostic portfolio of daim 8 wherein said microarray is a cDNA microarray. 

10. The diagnostic portfolio of claim 8 wherein said microarray is an oligonucleotide microarray. 

13. The kit of claim 12 further comprising reagents for conducting a microarray analysis. 

14 -orX^ 

1S " t^nTS.K ° f C ' airn 15 Wherein the assessment ^ response to therapy includes a determination of whether 
the patent is improvmg, not improving, relapsing, likely to improve, or likely to relapse. 

1? ' ^llTT ? C ° ,0r ! Ctal CanC6r Sta,UtS COmprisin 9 isolated nuctei0 ^id sequences, their complements 
or portions thereof of a combination of genes selected from the group consisting of Seq. ID No 42 49 

^iSE^^ their 
complements, or port,ons thereof of a combination of genes selected from the group consisting of Seq V No 
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